Orphan receptor

ABSTRACT

PCT No. PCT/EP96/03933 Sec. 371 Date May 8, 1997 Sec. 102(e) Date May 8, 1997 PCT Filed Sep. 9, 1996 PCT Pub. No. WO97/09348 PCT Pub. Date Mar. 13, 1997This invention relates to a novel estrogen receptor-related nuclear receptor, hereinafter termed &#34;ER beta &#34; having the amino acid sequence of FIGS. 1, 13A or 14A or substantially the same ambino acid sequence as the amino acid sequence shown in FIGS. 1, 13A or 13B or an amino acid sequence functionally similar to that sequence. The invention also relates to DNA sequences encoding the receptor. The receptor may be useful in isolating molecules for the treatment of disorders such as prostate cancer, benign prostatic hyperplasia, osteoporosis or cardiovascular disorders and in the testing of substances for estrogenic and other hormonal effects.

This invention relates to cellular nuclear receptors and their uses.

A large family of nuclear receptors which confer cells with responsiveness to molecules such as retinoid acid, vitamin D, steroid hormones and thyroid hormones has been identified. Extensive studies have shown that the members of this superfamily of nuclear receptors activate and/or repress gene transcription through direct binding to discrete cis-acting elements termed "hormone response elements" (HRE). It has been shown that these HRE's comprise repeats of consensus palindromic hexanucleotide DNA motifs. The specificity of the HRE's is determined by the orientation of, and spacing between, halfsites (i.e. half a palindromic sequence)(Umenesono K., et al, 1991 Cell 65, 1255-1266).

Specific DNA binding is mediated by a strongly-conserved DNA binding domain, containing two zinc fingers, which is conserved among all thus discovered nuclear receptors. Three amino acids at the C-terminal base of the first zinc finger (known as the "P-box") are important for the recognition of the half site nucleotide sequence. Members of the nuclear receptor superfamily have been classified into different groups on the basis of the amino acid sequence within the P box.

All members of the nuclear receptor superfamily also contain a hypervariable N-terminal domain and a ligand-binding domain containing some "patches" of conserved sequence. One of these is called the "Ti-domain".

Molecules which are thought to be nuclear receptors, as they are structurally related to cliaracterised receptors, but for which no ligand has been found, are termed "orphan receptors". Many such orphan receptors have been identified (see for example Evans R. M. (1988) science 240,889-895 and O'Malley, B. (1990) Mol. Endocrinol. 4 363-369)

We have now unexpectedly identified, initially in rat a new orphan receptor, which is related to the known estrogen receptor ERα, and which we have designated "ERβ" (specifically "rERβ" in rat). In this specification "ERβ" will be used to refer to the receptors hERβ or rERβ or related receptors. The nucleotide and amino acid sequences of rERβ have now been determined and are shown in FIG. 1. We have also identified a human ERβ--"hERβ", the amino acid DNA and sequences of which are shown in FIG. 13A and 13B respectively.

According to one aspect of the invention there is provided a novel estrogen receptor-related nuclear receptor, hereinafter termed "ERβ" having the amino acid sequence of FIGS. 1, FIG. 13A or 14A or substantially the same amino acid sequence as the amino acid sequence shown in FIGS. 1, 13A or 14A or an amino acid sequence functionally similar to those sequences. The isolated receptor may be particularly useful in the search for molecules for use in treatment of diseases or conditions such as cardiovascular diseases, central nervous system diseases or conditions or osteoporosis, prostate cancer or benign prostatic hyperplasia.

The receptor of the invention may also be used in the testing of environmental chemicals for estrogenic activity. There has been increasing concern over the effect of various chemicals released into the environment on the reproduction of humans and animals. Threats to the reproductive capabilities of birds, fish, reptiles, and some mammals have become evident and similar effects in humans have been proposed. Substantial evidence is now emerging which shows that exposure to certain chemicals during critical periods of foetal life may distort the development of the reproductive organs and the immune and nervous systems. On the basis of possible parallels between actual wildlife effects, seen for example in birds and seals living in highly polluted areas, and proposed effects in humans, in combination with documented human reproductive effects caused by prenatal exposure to the pharmaceutical estrogen, diethyl stilbestrol (DES), "estrogenic" chemicals have been proposed to threaten the reproductive capability of both animals and humans. Among the chemicals known or suspected to act as estrogen mimics on the human body, or in other ways disturb the human endocrine system, there are several which have already been identified as environmental hazards. Among the chemicals that have been mentioned as potential causes of disruption of reproductive function in animals and humans are chlorinated organic compounds such dieldrin, endosulfans, chlordanes, endrins, aldrin, DDT and some PCBs, plastics such as Bisphenol A, phthalates and nonylphenol, and aromatic hydrocarbons. Some of the proposed effects on humans have been suggested to be due to an increasing exposure to environmental estrogens--in fact, exposure to chemical compounds to which higher organisms during the foetal period react in a way that is similar to when they are exposed to high dosages of estrogens. The effects are manifested by for example perturbations of the sex characteristics and impaired reproductive potential. In humans, elevated risks of breast cancer and other hormone-related disease has also been discussed as possible effects. In addition, to the documented "estrogcnic" effects, it has recently been demonstrated that environmental pollutants may also act on hormonal pathways other than the estrogenic pathway--it has been shown that p,p'--DDE the main metabolite of DDT (also in humans) is a fairly anti-androgenic agent (Kelce W. R. et al Nature 1995 375:581-585). Epidemiological studies on these issues are, however, presently difficult to interpret. Nevertheless, there is a growing opinion against these potentially hormone disrupting chemicals, and very palpable public and environmental demand for the governmental agencies and industry to act. In view of the similarities between the receptor of the present invention, ERβ and the classical estrogen receptor, ERβ may be used in the testing of chemicals for estrogenic effect.

An amino acid sequence functionally-similar to the sequence shown in FIGS. 1, 13A or 14A may be from a different mammalian species.

An amino acid sequence which is more than about 89%, identical with the sequence shown in FIGS. 1, 13A or 14A is substantially the same amino acid sequence for the purposes of the present application. Preferably, the amino acid sequence is more than about 95identical with the sequence shown in FIGS. 1, 13A or 14A.

According to another aspect of the invention there is provided a DNA sequence encoding a nuclear receptor according to the first aspect of the invention. Preferably, the DNA sequence is that given in FIGS. 1, 13A or 14A or is a DNA sequence encoding a protein or polypeptide having the functionality of ERβ.

ERβ is unique in that it is extremely homologous to the rat estrogen receptor, in particular in its DNA binding domain. It appears that ERβ has a very limited tissue distribution. In female rats, it appears to be present only in the ovaries, and in male rats in the prostate and testes. As these tissues are classic targets for estrogen action, it can be deduced that ERβ may mediate some of the effects of estrogen.

The different ligand specificity of ERα and ERβ may be exploited to design pharmaceutical agents which are selective for either receptor. In particular, the differences in ligand specificity may be used to develop drugs that specifically target cardiovascular disease in postmenopausal women or osteoporosis.

The nuclear receptor of the invention, ERβ, a method of producing it, and tests on its functionality will now be described, by way of example only, with reference to the accompanying drawings. FIGS. 1 to 15 in which:

FIG. 1 shows the amino acid sequence of ERβ (SEQ ID NO:2) and the nucleotide sequence of the gene encoding it (SEQ ID NO:1);

FIG. 2A is a phylogenetic tree showing the evolution of ERβ and other receptors;

FIG. 2B shows the homology between the different domains in ERβ and certain other receptors;

FIG. 2C is an alignment of the amino acid sequence in the ligand binding domains of rERβ, (SEQ ID NO:7) rERα, (SEQ ID NO:8) mERα (SEQ ID NO:9) and hERα (SEQ ID NO:10);

FIG. 2D is an alignment of the amino acid sequence in the DNA binding domains of rERβ(SEQ ID NO:11), rERα, mERα and hERα (SEQ ID NO:12);

FIG. 3A is a film autoradiograph of prostate gland showing strong expression of a clone of the receptor of the invention clone 29;

FIG. 3B is a darkfield image showing prominent signal for clone 29 in epithelium (e) of prostatic alveoli. The stroma(s) exhibit(s) weaker signal;

FIG. 3C is a bipolarization image of cresyl violet counterstained section showing silver gains over epithelium (e) whereas the stroma(s) contain(s) less grains;

The bar represents 0.7 mm for FIG. 3A, 200 μm for FIG. 3B and 30 μm for FIG. 3C;

FIG. 4A shows a film autoradiograph of ovary showing strong expression of clone 29 in follicles at different developmental stages (some are indicated by arrows). The interstitial tissue (arrowheads) shows low signal;

FIG. 4B shows a darkfield image showing high expression of clone 29 in granular cells of primary (1) secondary (2), tertiary (3) and mature (4) follicles. Low signal is present in interstitial tissue (it);

FIG. 4C is a bipolarization image of ovary a showing strong signal in granular cells (gc), whereas the oocyte (oc) and the cainterna (ti) are devoid of clear signal;

The bar represents 0.9 mm for FIG. 4A, 140 μm for FIG. 4B and 50 μm for FIG. 4C;

FIG. 5A illustrates the results of saturation ligand binding analysis of cloned ERβ;

FIG. 5B illustrates the specificity of ligand binding by cloned ERβ;

FIG. 5C illustrates E2 binding by ERβ;

FIG. 6 illustrates the activation of transcription by cloned ERβ;

FIGS. 7 and 7A illustrates stimulation by various ligands by cloned ERβ;

FIG. 8 illustrates the results of RT-PCR experiments on the expression on rat estrogen receptors:

FIG. 9 illustrates the results of RT-PCR experiments on the expression of human ERβ (hERβ);

FIG. 10A is a Hill plot comparing binding of ¹²⁵ I-E2 by hERα and rERβ,

FIG. 10B is a Scatchard plot comparing binding of ¹²⁵ I-E2 by hERα and rERβ;

FIG. 11A illustrates the relative binding affinity of hERα and rERβ for various ligands;

FIG. 11B is a detail of FIG. 12A;

FIG. 12 is an alignment of various estrogen receptors; Rat ERβ (SEQ ID NO:13), Mouse ERβ (SEQ ID NO:14), Human ERβ from testis (SEQ ID NO:15), Rat ERα (SEQ ID NO:16), Human ERα (SEQ ID NO:17), Human ERR1 (SEQ ID NO:18), and Human ERR2 (SEQ ID NO:19)

FIG. 13A shows the amino acid sequence of human ERβ; (SEQ ID NO:3)

FIG. 13B shows the DNA sequence of human Erβ; (SEQ ID NO:4)

FIG. 14A shows the amino acid sequence of mERβ; (SEQ ID NO:5)

FIG. 14B shows the DNA sequence of mouse ERβ; (SEQ ID NO:6) and

FIG. 15 illustrates ligand binding affinities for various phytoestrogens by ER's of the invention.

A. CLONING OF RAT ERβ

1. PCR-Amplification and Complementary DNA Cloning

A set of degenerate primers (DBD 1,2,3 and WAK/FAK) were designed previously according to the most highly conserved sequences of the DNA-binding domain (P-box) and ligand binding domain (Ti-stretch) of members of the nuclear receptor family (Enmark, E., Kainu, T., Pelto-Huikko, M., & Gustafsson, J-Å (1994) Biochem. Biophys. Res. Commun. 204, 49-56). Single strand complementary DNA reverse transcribed from rat prostate total RNA was employed with the primers in PCR reactions as described in Enmark, E., Kainu, T., Pelto-Huikko, M., & Gustafsson, J-Å (1994) Biochem. Biophys. Res. Commun. 204, 49-56. The amplification products were separated on a 2% low melting agarose gel and DNA products between 400 and 700 bp were isolated from the gel and ligated to TA cloning vector (Invitrogen). As alternatives, we also used the RP-I/RP-2 and DBD66-100/DBD210-238 primer sets in the DNA-binding domain of nuclear receptors exactly as described by Hirose T., Fijimoto, W., Yamaai, T., Kim, K. H., Matsuura, H., & Jetten, A. M (1994) Mol. Endocrinol. 8, 1667-1677 and Chang, C., Lopes Da Silva, S., Ideta, R., Lee, Y., Yeh, S., & Burbach, J. P. H. (1994) Proc. Natl. Acad. Sci. 91, 6040-6044 respectively. Clone number 29 (obtained with the DBD-WAK/FAK set) with a length of 462 bp showed high homology (65%) with the rat estrogen receptor cDNA (65%), which was previously cloned from rat uterus (Koike. S., Sakai, M., & Muramatsu, M. (1987) Nucleic Acids Res 15, 2499-2513). The amino acid residues predicted by clone 29 DNA sequences suggested that this DNA fragment encoded part of the DNA-binding domain, hinge region and the beginning of the ligand binding domain of a novel member of the nuclear receptor family. Two PCR primers (FIG. 1) were used to generate a probe of 204 bp consisting of the hinge region of the novel receptor, which was used to screen a rat prostate cDNA library (Clontech gt10) under stringent conditions resulting in four strongly positive clones with a size of 0.9 kb, 18 kb, 2.5 kb and 5-6 kb respectively. The clone of 2.5 kb was sequenced and FIG. 1 shows the nucleotide sequence determined in the core facility (CyberGene AB) by cycle sequencing using fluorescent terminators (Applied Biosystems) on both strands, with a series of internal primers and deduced amino acid sequence of clone 29. Two in frame ATG codons are located at nucleotide 424 and nucleotide 448, preceding by an in-frame stop codon at nucleotide 319, which suggests that they are possible start codons. The open reading frame encodes a protein of 485 amino acid residues (counted from the first methionine) with a calculated molecular weight of 54.2 kDa. Analysis of the proteins by synthesized by in-vitro translation from the clone 29 cRNA in rabbit reticulocyte lysate revealed a doublet protein band nigrating at approximately 57 kDa on SDS-PAGE gels (data not shown), confirming the open reading frame. The doublet protein band is probably caused by the use of both ATG codons for initiation of protein synthesis. The amino acid sequence of clone 29 protein shows the characteristic zinc module DNA-binding domain, hinge region and a putative ligand binding domain, which are the characteristic features of members of the nuclear receptor family (Tsai, M-J., & O'Malley, B.W (1994) Ann. Rev. Biochem. 63, 451-486; Hard, T., & Gustafsson, J-Å (1993) Acc. Chem. Res. 26, 644-650; Laudet, V., Hanni, C., Coli, J., Catzeflis, F., & Stehelin, D (1992) EMBL J. 11, 1003-1012).

Protein sequence comparison with several representative members of the nuclear receptor family (FIG. 2) showed the clone 29 protein is most related to the rat estrogen receptor (ERα), cloned from uterus (Koike, S., Sakai, M., & Muramatsu, M. (1987) Nucleic. Acids Res. 15, 2499-2513), with 95% identity in the DNA-binding domain (amino acid residues 103-167) (Griffiths, K., Davies, P., Eaton C. I., Harper, M. E., Turkes, A., & Peeling, W. B. (1991) in Endocrine Dependent Tumours, eds. Voigt, K-D. & Knabbe, C. (Raven Press), pp. 83-125). A number of functional characteristics have been identified within the DNA-binding, domain of nuclear receptors (Hard, T., & Gustafsson, J-Å. (1993) Acc. Chem. Res 26 644-650 and Zilliacus, J., Carlstedt-Duke, J., Gustafsson, j-Å., & Wright, A. P. H. (1994) Proc. Natl. Acad. Sci. USA 91, 4175-4179). The so-called P-box specifies nucleotide sequence recognition of the core half-site within the response element, while the D- box mediates dimerization between receptor monomers. The clone 29 protein P-box and D-box sequences of EGCKA and PATNQ, respectively, are identical to the corresponding boxes in ERα (Hard, T., & Gustafsson. J-Å. (1993) Acc. Chem. Res 26, 644-650 and Koike, S., Sakai, M., & Muramatsu, M. (1987) Nucleic Acids Res. 15, 2499-2513), thus predicting that clone 29 protein binds to "estrongen response element" (ERE) sequences.

The putative ligand binding domain (LBD) of clone 29 protein (amino acid residues 259-457) shows closest homology to the LBD of the rat ERα (FIG. 2), while the homology with the human ERR1 and ERR2 proteins (Giguere, V., Yang, N., Segui. P., & Evans R. M. (1988) Nature 331, 91-94) is considerably less. With the human, mouse and xenopus estrogen receptors the homology in the LBD is also around 55%, while the homology with the LBD of other steroid receptors is not signiticant (FIG. 2). Cysteine residue 530 in human ERα has been identified as the covalent attachment site of an estrogenic affinity label (Harlow, K. W., Smith D. N., Katzenellenbogen, J. A., Greene, G. L., & Katzenellenbogen, B. S. (1989)J. Biol. Chem. 264, 17476-17485). Interestingly, clone 29 protein (Cys-436) as well as the mouse, rat and xenopus ERαs have a cysteine residue at the corresponding position. Also, two other amino acid residues described to be close to or part of the ligand-binding pocket of the human ERα-LBD (Asp 426 and Gly 521) are conserved in the LBD of clone 29 protein (Asp 333 and Gly 427) and in the LBD of ERαs from various species (20,21). The ligand-dependent transactivation function TAF-2 identified in ERα (Danielian, P. S., White, R., Lees, J. A., & Parker, M. G. (1992) EMBO J. 11, 1025-1033), which is believed to be involved in contacting other transcription factors and thereby influencing activation of transcription of tarteg genes, is almost completely conserved in clone 29 protein (amino acid residues 441-457). Steroid hormone receptors are phosphoproteins (Kuiper, G. & Brinkmann, A. O. (1994) Mol. Cell. Endocrinol. 100, 103-107), and several phosphorylation sites identified in the N-terminal domain and LBD of ERα (Arnold, S. F., Obourn, J. D., Jaffe, H., & Notides. A. C. (1995) Mol. Endocrinol 9, 24-33 and Le Goff, P., Montano, M. M., Schodin, D. J., & Katzeiiellenbogen, B. S (1994) J Biol. Chem. 269, 4458-4466) are conserved in clone 29 protein (Ser 30 and 42, Tyr 443). Clone 29 protein consists of 485 amino acid residues while ERαs from human, mouse and rat consist of 590-600 amino acid residues. The main difference is a much shorter N-terminal domain in clone 29 protein i.e 103 amino acid residues as compared to 185-190 amino acid residues in the other receptor proteins. Also the non-conserved so-called F-domain at the C-terminal end of ERαs is 15 amino acid residues shorter in clone 29 protein. The cDNA insert of a positive clone of 2.6 kb was subcloned into the EcoRI site of pBluescript (trademark) (Stratagene). The complete DNA sequence of clone 29 was determined (CyberGene AB) by cycle sequencing using fluorescent terminators (Applied Biosystems) on both strands, with a series uf internal primers.

FIGS. 2C and 2D respectively compare the ligand and DNA binding domain of ERα compared to rat, mouse and human Erα's.

2. Saturation Ligand Binding Analysis and Ligand Competition Studies

Clone 29 cDNA was subcloned in pBluescript downstream of the T7 promoter to give p29-T7. Clone 29 protein was synthesized in vitro using the TnT-coupled reticulocyte lysate system (Promega). Translation reaction mixtures were diluted five times with TEDGMo buffer (40 mm Tris/HCl, pH 7.4, 1 mM EDTA, 10% (v/v) glycerol, 10 mM Na₂ MoO₄, 10 mM DTT) and 0.1 ml aliquots were incubated for 16 h at 8° C. with 0.3-6.2 nM 2,4,6,7-³ H!-17β-estradiol (NEN-Dupont; specific radioactivity 85 Ci/mmol) in the presence or absence of a 200-fold excess of unlabelled E2.

FIG. 5A illustrates the results of a saturation ligand analysis of clone 29 protein. Reticulocyte lysate containing clone 29 protein was incubated with 6 concentrations of ³ H!E2 between 0.3 and 6.0 nM. Parallel tubes contained an additional 200 fold of non-radioactive E2. Bound and free ligand were separated with a dextran-coated charcoal assay. The Kd (0.6 nM) was calculated from the slope of the line in the Scatchard plot shown (r=0.93), and the number of binding sites was extrapolated from the intercept on the abscissa (Bmax=1400 fmol/ml undiluted translation mixture).

For ligand competition studies diluted reticulocyte lysate was incubated with 5 nM 2.4,6.7-³ H!-17β-estradiol in the presence of either 0, 5, 50, 500 or 5,000 nM of non- radioactive E2, estrone, estriol, testosterone, progesterone, corticosterone, 5α-androstane-3β,17β-diol, 5α-androstane-3α, 17β-diol and diethylstilbestrol (DCES) for 16 h at 8° C. Bound and unbound steroids were separated with a dextran-coated charcoal assay (Ekman, P., Barrack, E. R., Greene, G. L., Jensen, E. V., & Walsh, P. C. (1983) J. Clin. Endocrinol Metab. 57, 166-176).

FIG. 5B illustrates the specificity of ligand binding by clone 29 protein. Reticulocyte lysate containing clone 29 protein was equilibrated for 16 h with 5 nM ³ H!E2 and the indicated fold excess of competitors. Data represent ³ H!E2 bound in the presence of unlabelled E2, testosterone (T), progesterone (prog), corticosterone (cortico), estrone (E1), diethylstilbestrol (DES), 5α-androstane-3α, 17β-diol (3α-AD), 5α-androstane-3β,17β-diol (3β-AD) and estriol (E3). ³ H!E2 binding in the absence of competitor was set at 100%.

3. In-situ Hybridisation

In- situ hybridisation was carried out as previously described (Dagerlind Å., Friber, K., Bean, A. J., & Hokfelt, T (1992) Histochemistry 98, 39-49). Briefly, two oligonucleotide probes directed against nucleotides 994-1041 and 1981-2031 were each labelled at the 3'-end with ³³ P-dATP using terminal deoxynucleotidyltransferase (Amersham, UK). Adult male and female Sprague-Dawley rats (age 2 to 3 months n=10) were used for this study. The rats were decapitated and the tissues were rapidly excised and frozen on dry ice. The tissues were sectioned in a Microm HM500 cryostat at 14 μm and thawed onto Probe-On glass slides (Fisher Scientific, PA, USA). The slides were stored at -20° C. until used. The slides were incubated in humidifed boxes at 42° C. for 18 h with 1×10⁶ cpm of the probe in a hybridization solution containing 50% tonnamide. 4×SSC (1×SSC=0.15 M NaCl, 0.015 M sodium citrate), 1×Denhardt (0.02% BSA, 0.02% Ficoll, 0.02% PVP), 1% sarkosyl, 0.02 M sodium phosphate (pH7.), 10% dextransulphate, 500 μg/ml salmon sperm DNA and 200 mnM DTT. Slides were subsequently rinsed in 1×SSC at 55° C. for 60 min with four changes of SSC and finally in 1×SSC starting at 55° C. and slowly cooled to room temperature, transferred through distilled water and briefly dehydrated in 50% and 95% ethanol for 30 sec each, air-dried, and covered with Amersham β-man autoradiography film for 15 to 30 days. Alternatively the slides were dipped in Kodak NTB2 nuclear track emulsion (diluted 1:1 with distilled water) and exposed for 30 to 60 days at 4° C. Finally, the sections were stained with cresyl violet.

Clear expression of clone 29 was observed in the reproductive tract of both male and female rats, while in all other rat tissues the expression was very low or below the level of detection with in-situ hybridisation (not shown). In male reproductive organs high expression was seen in the prostate gland (FIG. 3), while very low expression was observed in testis, epididymis and vesicula seminalis (not shown). In dipped sections, expression was clearly visible in prostate epithelial cells (secreting alveoli) while the expression in smooth muscle cells and fibroblasts in the stroma was low (FIG. 3). In female reproductive organs expression was seen in the ovary (FIG. 4), while uterus and vagina were negative (not shown). In dipped sections high expression was seen in the granulosa cell layer of primary, secondary and mature follicles (FIG. 4), whereas primordial follicles, oocytes and corpora lutea appeared completely negative. Low expression was seen in the interstitial cells of the ovary. Both anti-sense oligonucleotide probes used produced similar results. Addition of a 100 fold excess of the respective unlabelled oligonucleotide probes during the hybridisation reactions abolished all signals.

4. Transactivation Analysis in CHO-Cells

The expression vector pCMV29 was constructed by inserting the 2.6 kb clone 29 fragment in the EcoRI site of the expression vector pCMV5 (Andersson, S., Davis, D. L., Dahlback, H., Jornvall, H., & Russell, D. W. (1989) J. Biol. Chem. 264, 8222-8229). The pERE-ALP reporter construct contains a secreted form of the plancental alkaline phosphatase gene (Berger, J., Hauber, J., Hauber, R., Geiger, R., & Cullen, B. R. (1988) Gene 66, 1-10) and the MMTV-LTR in which the glucocorticoid response elements were replaced by the vitellogenin promoter estrogen response element (ERE).

CHO-K1 cells were seeded in 12-well plates at approximately 1.7×10⁵ cells per well in phenol-red free Ham F12 medium with 5% FCS (dextran-coated charcoal treated) and 2 mM Lglutamine. After 24 h the cells were transfected with 250 ng pERL-ALP vector and 50 ng pCMV29 using lipofectamine (Gibco) according to the manufacturer's instructions. After five hours of incubation the cells were washed and refed with 0.5 ml phenol-red free Coon's F-12 medium containing 5% serum substitute (SRC 3000, Tissue Culture Services Ltd., Botolph Claydon, Buckinghnam, UK) 2 mM Lglutamine and 50 μg/ml gentamicin plus hormones as indicated. After 48 h the medium was assayed for alkaline phosphatase (ALP) activity by a chemiluminescence assay. A 10 μl aliquot of the cell culture medium Was mixed with 200 μl assay buffer (10 mM diethanolamine pH 10.01mM MgCl₂ and 0.5 mM CSPD (Tropix Inc. Boston, USA) ) and incubated for 20 min at 37° C. before measurement in a microplate luminometer (Luminoskan; Labsystems, Finland) with integral measurement for 1 second. The ALP activity of ERE-reporter alone was set at 1.

5. Ligand Binding Characteristics and Transactivation Function of Clone 29 Protein

On the basis of the described high homology between clone 29 protein and rat ERα in the DBD and LBD it was hypothesized that clone 29 protein might encode a novel ER. Furthermore, biological effects of estrogens on rat prostate and ovary, which show high expression of clone 29 RNA, are well known (Griffiths, K., Davies, P., Eaton, C. I., Harper, M. E., Turkes, A., & Peeling W. B. (1991) in Endocrine Dependent Tumours, eds Voigt, K-D. & Knabbe, C. (Raven Press), pp 83-125, Richards, J. S (1994) Endocrine Rev. 15, 725-745; and Habenicht, U-F., Tunn, U. W., Senge, Th., Schroder, R. H., Schweikert, H. U., Bartsch, G., & E1 Etreby. M. F . (1993) J. Steroid Biochem. Molec. Biol. 44, 557-563). In order to analyze the steroid binding properties of clone 29 protein synthesized in vitro, the reticulocyte lysate was incubated at 8° C. for 16 h with increasing concentrations (0.3-6.0 nM) of ³ H!E2 in the presence or absence of a 200 fold molar excess of unlabelled E2. Linear transformation of saturation data revealed a single population of binding sites for E2 with a K_(d) (dissociation constant) of 0.6 nM (FIG. 5A and C). Steroid binding specificity was measured by incubating reticulocvte lysate with 5 nM ³ H!E2 in the presence of 0.5, 50, 500 and 5,000 nM unlabelled competitors. Competition curves generated are indicative of an estrogen receptor in that only estrogens competed efficiently with ³ H!E2 for binding (FIG. 5B). Fifty percent inhibition of specific binding occured by 0.6 fold excess of unlabelled E2; diethylstilbestrol, estriol, estrone and 5α-androstane-3β,17β-diol were 5, 15, 50 and 150 times, respectively, less effective as competitors. Neither testosterone, progesterone, corticosterone nor 5α- androstane-3α, 17β-diol were efficient competitors, even at the highest concentrations used (1000 fold excess). The dissociation constant and the steroid binding specificities measured are in good agreement with data previously reported for ERs in rat and human prostate, rat granulosa cells, rat antral follicles and whole rat ovarian tissue (Ekman, P., Barrack, E. R., Greene, G. L., Jensen, E. V., & Walsh. P. C (1983) J. Clin. Endocrinol. Metab. 57, 166-176; van Beurden-Lamers, W. M. O., Brinkmann, A. O., Mulder, E., & van der Molen, H. (1974) Biochem. J 140, 495-502; Kudolo, G. B., Elder, M. G., & Myatt, L. (1984) J. Endocrinol. 102, 83-91; and Kawashima, M., & GreenWald, G. S. (1993) Biology of Reprod. 48 172-179).

When clone 29 protein was labelled with a saturating dose of ³ H!E2 and analyzed on sucrose density gradients, a single peak of specifically bound radioactivity was observed. The sedimentation coefficient of this complex was about 7S, and it shifted to 4S in the presence of 0.4 M NaCl (not shown). To investigate the transcriptional regulatory properties of clone 29 protein, we performed co-transfection experiments in which CHO cells were transfected with a clone 29 protein expression vector and/or an estrogen-responsive reporter gene construct. Cells were incubated in the absence of E2 (clone 29) or in the presence of 100 nM E2 (Clone 29+F2) or in the presence of 100 nM E2 and 12 μM Tamoxifen (Clone 29+E2/Tam). In the absence of exogenously added E2 clone 29 protein showed considerable transcriptional activity which could be further increased by the addition of 100 nM E2 (FIG. 6). Simultaneous addition of a 10 fold excess of the antiestrogen Tamoxifen partially suppressed the E2 stimulated activity (FIG. 6). The constitutive transcriptional activity of clone 29 protein could be suppressed by the anti-estrogen ICI-1624384 (not shown). It has been shown previously that the wild-type mouse and human ERs are constitutive activators of transcription, and that the transcriptional activity can be stimulated further by the addition of E2 (Txukerman, M., Xiao-Kun Zhang., Hermann, T., Wills, K. N., Graupner, G., & Phal. M. (1990) New Biologist 2, 613-620 and Lees, J. A., Fawell, S. E., & Parker, M. G. (1989) Nucl. Acids Res. 17, 5477-5488). To obtain more insight into what concentrations of E2 effect clone 29 protein transcriptional activity, transient transfection experiments were carried out in the presence of increasing concentrations of E2. CHO-cells were transiently transfected with the ERE-reporter plasmid and the clone 29 protein expression plasmid. Cells were incubated with increasing concentrations of E2 (0.1-1000 nM), estrone (E1, 1000 nM). 5α-androstane-3β,17β-diol (3β-AD, 1000 nM) or no ligand added. Alkaline phosphatase activity (ALP) was measured as described and the activity in the absence of ligand (control) was set at 1. The figure shows relative ALP-activities (±SD) from three independent experiments. Clone 29 protein began to respond at 0.1 nM E2 and maximal stimulation was observed between 1 nm and 10 nM E2 (FIG. 7). The maximal stimulation factor was 2.6±0.5 fold (mean±SD. n=9) as compared to incubation in the absence of E2. Apart from E2 also estrone and 5α-androstane-3β,17β-diol could stimulate transcriptional activity, albeit at higher concentrations (FIG. 7). Dexamethasone, testosterone, progesterone, 5α-androstane-3α,17β-diol, thyroid hormone and all-trans-retinoic acid could not stimulate transcriptional activity of clone 29 protein, even at the highest concentration (1000 nM) tested (not shown). The results of the co-transfection experiments are in agreement with the ligand binding and specificity data of clone 29 protein presented in FIG. 5. In control experiments, wild-type human ERα also showed transcriptional activity in the absence of E2, which could be increased by the addition of E2 (not shown).

6. Detection of Rat ER Expression by RT-PCR

The tissue specificity of expression of rat ERβ and ERα was determined using reverse transcriptase polymerase chain reaction (RT-PCR). The results of the experiment are shown in FIG. 8.

B. Isolation of Human ERβ

1 A human version of ERβ (hERβ) has also been cloned from human ovary. The tissue specificity of hERβ expression in a variety of cells was also determined using the RT-PCR technique. The results are shown in FIG. 9. It will be noticed that there is a very high level of mRNA of hERβ in human umbilical vein endothelial cells (HUVEC) but no detection of hERα in the same cells. In addition, it will be seen that in human osteosarcoma cell line (HOS-D4), hERβ is expressed in greater quantities compared to hERα.

I. A human version of ERβ (hERβ) has also been cloned. The tissue specificity of hERβ expression in a variety of cells was also determined using the RT-PCR technique. The results are shown in FIG. 9. It will be noticed that there is a very high level of mRNA of hERβ in human umbilical vein endothelial cells (HUVEC) but no detection of hERα in the same cells. In addition, it will be seen that in human osteosarcoma cell line (HOS-D4), hERβ is expressed in greater quantities compared to hERα.

The partial DNA sequence of hERβ is shown in FIG. 13B (SEQ ID NO:4) and a derived amino acid sequence is shown in FIG. 13A (SEQ ID NO:3).

Cloning of Human ERβ from testis

A commercially available cDNA from human testis (Clontech, article no. HL1161x) was screened, using a fragment containing the ligand-binding domain of the rat ERβ cDNA as probe. Approximately 10⁶ recombinants were screened, resulting in one positive clone. Upon sequencing of this clone, it was seen that the insert was 1156 bp (FIGS. 13A and 13B). This corresponds to most of the translated region of a receptor with an overall homology of 90.0% to rat ERβ, therefore deduced to represent the human form of ERβ.

The cloned hERβ, however, lacks approximately 47 amino acids at the N-teninal end and 61 amino acids at the C- terminal end (as compared to the rat sequence). Further screening of the same library was unsuccessful. PCR technology was therefore used to obtain the remaining parts. For oligonucleotides were synthesised, two degenerate oligonucleotides containing all possible codons for the amino acids adjacent to the initiation methlionine and the stop codon, respectively, of the rat ERβ, and two specific oligonucleotides containing the sequence of the clone isolated from the human testis library and situated approximately 100 bp from respective end of this clone. PCR with the N-terminal and C-terminal pair of oligos yielded specific bands, that were subcloned and sequenced. The parts of these new clones that overlap the original cDNA clone are identical to this. It was thus possible to construct peptide and DNA sequences corresponding to the whole open reading frame (FIGS. 13A and 13B).

When comparing the human ERβ to rat ERβ, this receptor is 79.6% identical in the N-terminal domain, 98.5% in the DNA-binding domain, 85.6% in the hinge and 91.6% in the ligand-binding and F-domains. These numbers match very well those found when comparing the rat and human forms of ERα.

Studies of the expression of human ERβ using Northern blot show expression in testis and in ovaries. The expression in prostate, however, appears lower than found in the rat.

The human ERβ gene has been mapped to chromosome 14 using PCR and to region 14q22-23 using the FISH technique, whereas the human ERβ gene has been mapped to chromosome 6q25.

2. Comparison of Ligand Binding Affinity of hERα and rERβ

The ligand affinity of the two estrogen receptors, human ERα (ovary) (hERα) and rat ERβ (rERβ) was tested in binding saturation experiments and in binding competition experiments.

cDNA of the receptor subtypes hERα and rERβ were in vitro translated in rabbit reticulocyte lysate in presence of non-radioactive amino acids according to the instructions supplied by the manufacturer (Promega).

The radioactive ligand used in all experiments was 16α- ¹²⁵ I!-17β-estradiol ( ¹²⁵ I!-E2) (NEX-144, New England Nuclear). The method for the binding experiments was previously described in: Salomonsson M, Carlsson B, Haggblad J. J. Steroid Biochem. Molec. Biol. Vol. 50, No. 5/6 pp. 313-18, 1994. In brief, estrogen receptors are incubated with ¹²⁵ I!-E2 to equilibrium (16-18 h at +4° C.). The incubation was stopped by separation of protein-bound ¹²⁵ I!-E2 from free ¹²⁵ I!-E2 on Sephadex G25 columns. The radioactivity of the eluate is measured in a gamma-counter.

In the competition experiments, non-radioactive ligands were diluted in DMSO, mixed with ¹²⁵ I!-E2 (approximately 100-200 pM), aliquoted in parallel, and finally hERα or rERβ was added. The final concentration of DMSO in the binding buffer was 2%.

The buffer used in the experiments was of the following composition:

Hepes (pH=7.5) 20 mM, KC1 150 mM, EDTA 1 mM, glycerol (8.7%), monothioglycerol 6 mM, Na₃ MO₄ 10 mM.

3. Equilibrium Binding Saturation Experiments (K_(d) -Determinations)

A range of concentrations of ¹²⁵ I!-E2 were mixed with the ER:s and incubated as described above, free ¹²⁵ I!-E2 was determined by subtracting bound ¹²⁵ I!-E2 from added ¹²⁵ I!-E2. Binding data was analysed by Hill-plots and by Scatchard plots (FIG. 11). The equilibrium binding results are shown in Table 1. The apparent K_(d) -values for ¹²⁵ I!-E2 differed between the two ER:s with approximately a factor of four; K_(d) (hERα):K_(d) (rERβ)=1:4.

                  TABLE 1     ______________________________________     Equilibrium dissociation constants for  .sup.125 I!-E2     to the two subtypes.     Receptor subtype                   K.sub.d (Hill-plot)                             K.sub.d (Scatchard-plot)     ______________________________________     hERα    0.06 nM   0.09 nM     rERβ     0.24 nM   0.42 nM     ______________________________________

4. Competition Experiments (IC₅₀ Determinations)

The experiments were performed as described above. IC₅₀ values were obtained by applying a four parameter logistic analysis; b=((b_(max) -b_(min))/(1+(I/IC₅₀)^(S)))+b_(min), where I is the added concentration of binding inhibitor, IC₅₀ is the concentration of inhibitor at half maximal binding and S is a slope factor. The free concentration of ¹²⁵ I!-E2 was determined by sampling an aliquot from the wells at the end of the incubation and then substract bound radioactivity from sampled total radioactivity.

Since the equilibrium binding experiments (above) showed that the K_(d) -values for ¹²⁵ I!-E2 differed between the two ER:s, K_(i) -values (from the Cheng-Prusoff equation: K_(i) =IC₅₀ /(1+L/K_(d)) where L is free ( ¹²⁵¹ I!-E2!) were calculated for the compounds investigated. Two approaches for calculating RBA (Relative Binding Affinitv) were used. The RBA values were derived using either the IC₅₀ values or the K_(i) values. In both approaches, the value for the compound 16α-bromo-cstradiol was selected as the reference value (100%). Both approaches gave similar results. The results are summarized in FIGS. 11A and 11B. In these Figures "4-OH-Tam"=4-hydroxy-tamoxifen; "DES"=diethylstilbestrol; "Hexestr"=hexestrol: "ICI-164384"=ICI plc compound no. 164382; "17β-E2"=17β-estradiol; "16a-β- E2"=16α-bromo-estradiol; "Ralox"=Raloxifen; and "17a-E2"=17α diol.

The results show that ERα and ERβ have significant different ligand binding affinities--the apparent K_(d) -values for ¹²⁵ I!-E2 differed between the two ELR's by a factor of about 4 (K_(d) (hERα): K_(d) (rERβ) ≈1:4). Some compounds investigated showed significant differences in the competition for binding of ¹²⁵ I!-E2 to the ER's. Certain compounds were found to be more potent inhibitors of ¹²⁵ I!-E2 binding to hERα as compared to rERβ whereas others were found to be more potent inhibitors of ¹²⁵ I!-E2 binding to rERβ than to hERα.

    __________________________________________________________________________     #             SEQUENCE LISTING     - (1) GENERAL INFORMATION:     -    (iii) NUMBER OF SEQUENCES: 19     - (2) INFORMATION FOR SEQ ID NO:1:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 2568 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Rattus ra - #ttus     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:     - GGAATTCCGG GGGAGCTGGC CCAGGGGGAG CGGCTGGTGC TGCCACTGGC AT - #CCCTAGGC       60     - ACCCAGGTCT GCAATAAAGT CTGGCAGCCA CTGCATGGCT GAGCGACAAC CA - #GTGGCTGG      120     - GAGTCCGGCT CTGTGGCTGA GGAAAGCACC TGTCTGCATT TAGAGAATGC AA - #AATAGAGA      180     - ATGTTTACCT GCCAGTCATT ACATCTGAGT CCCATGAGTC TCTGAGAACA TA - #ATGTCCAT      240     - CTGTACCTCT TCTCACAAGG AGTTTTCTCA GCTGCGACCC TCTGAAGACA TG - #GAGATCAA      300     - AAACTCACCG TCGAGCCTTA GTTCCCTGCT TCCTATAACT GTAGCCAGTC CA - #TCCTACCC      360     - CTGGAGCACG GCCCCATCTA CATCCCTTCC TCCTACGTAG ACAACCGCCA TG - #AGTATTCA      420     - GCTATGACAT TCTACAGTCC TGCTGTGATG AACTACAGTG TTCCCGGCAG CA - #CCAGTAAC      480     - CTGGACGGTG GGCCTGTCCG ACTGAGCACA AGCCCAAATG TGCTATGGCC AA - #CTTCTGGG      540     - CACCTGTCTC CTTTAGCGAC CCATTGCCAA TCATCGCTCC TCTATGCAGA AC - #CTCAAAAG      600     - AGTCCTTGGT GTGAAGCAAG ATCACTAGAG CACACCTTAC CTGTAAACAG AG - #AGACACTG      660     - AAGAGGAAGC TTAGTGGGAG CAGTTGTGCC AGCCCTGTTA CTAGTCCAAA CG - #CAAAGAGG      720     - GATGCTCACT TCTGCCCCGT CTGCAGCGAT TATGCATCTG GGTATCATTA CG - #GCGTTTGG      780     - TCATGTGAAG GATGTAAGGC CTTTTTTAAA AGAAGCATTC AAGGACATAA TG - #ATTATATC      840     - TGTCCAGCCA CGAATCAGTG TACCATAGAC AAGAACCGGC GTAAAAGCTG CC - #AGGCCTGC      900     - CGACTTCGCA AGTGTTATGA AGTAGGAATG GTCAAGTGTG GATCCAGGAG AG - #AACGGTGT      960     - GGGTACCGTA TAGTGCGGAG GCAGAGAAGT TCTAGCGAGC AGGTACACTG CC - #TGAGCAAA     1020     - GCCAAGAGAA ACGGTGGGCA TGCACCCCGG GTGAAGGAGC TACTGCTGAG CA - #CCTTGAGT     1080     - CCAGAGCAAC TGGTGCTCAC CCTCCTGGAA GCTGAACCAC CCAATGTGCT GG - #TGAGCCGT     1140     - CCCAGCATGC CCTTCACCGA GGCCTCCATG ATGATGTCCC TCACTAAGCT GG - #CGGACAAG     1200     - GAACTGGTGC ACATGATTGG CTGGGCCAAG AAAATCCCTG GCTTTGTGGA GC - #TCAGCCTG     1260     - TTGGACCAAG TCCGGCTCTT AGAAAGCTGC TGGATGGAGG TGCTAATGGT GG - #GACTGATG     1320     - TGGCGCTCCA TCGACCACCC CGGCAAGCTC ATTTTCGCTC CCGACCTCGT TC - #TGGACAGG     1380     - GATGAGGGGA AGTGCGTAGA AGGGATTCTG GAAATCTTTG ACATGCTCCT GG - #CGACGACG     1440     - TCAAGGTTCC GTGAGTTAAA ACTCCAGCAC AAGGAGTATC TCTGTGTGAA GG - #CCATGATC     1500     - CTCCTCAACT CCAGTATGTA CCCCTTGGCT TCTGCAAACC AGGAGGCAGA AA - #GTAGCCGG     1560     - AAGCTGACAC ACCTACTGAA CGCGGTGACA GATGCCCTGG TCTGGGTGAT TG - #CGAAGAGT     1620     - GGTATCTCCT CCCAGCAGCA GTCAGTCCGA CTGGCCAACC TCCTGATGCT TC - #TTTCTCAC     1680     - GTCAGGCACA TCAGTAACAA GGGCATGGAA CATCTGCTCA GCATGAAGTG CA - #AAAATGTG     1740     - GTCCCGGTGT ATGACCTGCT GCTGGAGATG CTGAATGCTC ACACGCTTCG AG - #GGTACAAG     1800     - TCCTCAATCT CGGGGTCTGA GTGCAGCTCA ACAGAGGACA GTAAGAACAA AG - #AGAGCTCC     1860     - CAGAACCTAC AGTCTCAGTG ATGGCCAGGC CTGAGGCGGA CAGACTACAG AG - #ATGGTCAA     1920     - AAGTGGAACA TGTACCCTAG CATCTGGGGG TTCCTCTTAG GGCTGCCTTG GT - #TACGCACC     1980     - CCTTACCCAC ACTGCACTTC CCAGGAGTCA GGGTGGTTGT GTGGCGGTGT TC - #CTCATACC     2040     - AGGATGTACC ACCGAATGCC AAGTTCTAAC TTGTATAGCC TTGAAGGCTC TC - #GGTGTACT     2100     - TACTTTCTGT CTCCTTGCCC ACTTGGAAAC ATCTGAAAGG TTCTGGAACT AA - #AGGTCAAA     2160     - GTCTGATTTG GAAGGATTGT CCTTAGTCAG GAAAAGGAAT ATGGCATGTG AC - #ACAGCTAT     2220     - AAGAAATGGA CTGTAGGACT GTGTGGCCAT AAAATCAACC TTTGGATGGC GT - #CTTCTAGA     2280     - CCACTTGATT GTAGGATTGA AAACCACATT GACAATCAGC TCATTTCGCA TT - #CCTGCCTC     2340     - ACGGGTCTGT GAGGACTCAT TAATGTCATG GGTTATTCTA TCAAAGACCA GA - #AAGATAGT     2400     - GCAAGCTTAG ATGTACCTTG TTCCTCCTCC CAGACCCTTG GGTTACATCC TT - #AGAGCCTG     2460     - CTTATTTGGT CTGTCTGAAT GTGGTCATTG TCATGGGTTA AGATTTAAAT CT - #CTTTGTAA     2520     #              2568GCTA TGTCATCTTT CTCTCTCTCC CGGAATTC     - (2) INFORMATION FOR SEQ ID NO:2:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 485 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Rattus ra - #ttus     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:     - Met Thr Phe Tyr Ser Pro Ala Val Met Asn Ty - #r Ser Val Pro Gly Ser     #                15     - Thr Ser Asn Leu Asp Gly Gly Pro Val Arg Le - #u Ser Thr Ser Pro Asn     #            30     - Val Leu Trp Pro Thr Ser Gly His Leu Ser Pr - #o Leu Ala Thr His Cys     #        45     - Gln Ser Ser Leu Leu Tyr Ala Glu Pro Gln Ly - #s Ser Pro Trp Cys Glu     #    60     - Ala Arg Ser Leu Glu His Thr Leu Pro Val As - #n Arg Glu Thr Leu Lys     #80     - Arg Lys Leu Ser Gly Ser Ser Cys Ala Ser Pr - #o Val Thr Ser Pro Asn     #                95     - Ala Lys Arg Asp Ala His Phe Cys Pro Val Cy - #s Ser Asp Tyr Ala Ser     #           110     - Gly Tyr His Tyr Gly Val Trp Ser Cys Glu Gl - #y Cys Lys Ala Phe Phe     #       125     - Lys Arg Ser Ile Gln Gly His Asn Asp Tyr Il - #e Cys Pro Ala Thr Asn     #   140     - Gln Cys Thr Ile Asp Lys Asn Arg Arg Lys Se - #r Cys Gln Ala Cys Arg     145                 1 - #50                 1 - #55                 1 -     #60     - Leu Arg Lys Cys Tyr Glu Val Gly Met Val Ly - #s Cys Gly Ser Arg Arg     #               175     - Glu Arg Cys Gly Tyr Arg Ile Val Arg Arg Gl - #n Arg Ser Ser Ser Glu     #           190     - Gln Val His Cys Leu Ser Lys Ala Lys Arg As - #n Gly Gly His Ala Pro     #       205     - Arg Val Lys Glu Leu Leu Leu Ser Thr Leu Se - #r Pro Glu Gln Leu Val     #   220     - Leu Thr Leu Leu Glu Ala Glu Pro Pro Asn Va - #l Leu Val Ser Arg Pro     225                 2 - #30                 2 - #35                 2 -     #40     - Ser Met Pro Phe Thr Glu Ala Ser Met Met Me - #t Ser Leu Thr Lys Leu     #               255     - Ala Asp Lys Glu Leu Val His Met Ile Gly Tr - #p Ala Lys Lys Ile Pro     #           270     - Gly Phe Val Glu Leu Ser Leu Leu Asp Gln Va - #l Arg Leu Leu Glu Ser     #       285     - Cys Trp Met Glu Val Leu Met Val Gly Leu Me - #t Trp Arg Ser Ile Asp     #   300     - His Pro Gly Lys Leu Ile Phe Ala Pro Asp Le - #u Val Leu Asp Arg Asp     305                 3 - #10                 3 - #15                 3 -     #20     - Glu Gly Lys Cys Val Glu Gly Ile Leu Glu Il - #e Phe Asp Met Leu Leu     #               335     - Ala Thr Thr Ser Arg Phe Arg Glu Leu Lys Le - #u Gln His Lys Glu Tyr     #           350     - Leu Cys Val Lys Ala Met Ile Leu Leu Asn Se - #r Ser Met Tyr Pro Leu     #       365     - Ala Ser Ala Asn Gln Glu Ala Glu Ser Ser Ar - #g Lys Leu Thr His Leu     #   380     - Leu Asn Ala Val Thr Asp Ala Leu Val Trp Va - #l Ile Ala Lys Ser Gly     385                 3 - #90                 3 - #95                 4 -     #00     - Ile Ser Ser Gln Gln Gln Ser Val Arg Leu Al - #a Asn Leu Leu Met Leu     #               415     - Leu Ser His Val Arg His Ile Ser Asn Lys Gl - #y Met Glu His Leu Leu     #           430     - Ser Met Lys Cys Lys Asn Val Val Pro Val Ty - #r Asp Leu Leu Leu Glu     #       445     - Met Leu Asn Ala His Thr Leu Arg Gly Tyr Ly - #s Ser Ser Ile Ser Gly     #   460     - Ser Glu Cys Ser Ser Thr Glu Asp Ser Lys As - #n Lys Glu Ser Ser Gln     465                 4 - #70                 4 - #75                 4 -     #80     - Asn Leu Gln Ser Gln                     485     - (2) INFORMATION FOR SEQ ID NO:3:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 485 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Homo sapi - #ens     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:     - Met Thr Phe Tyr Ser Pro Ala Val Met Asn Ty - #r Ser Ile Pro Ser Asn     #                15     - Val Thr Asn Leu Glu Gly Gly Pro Gly Arg Gl - #n Thr Thr Ser Pro Asn     #            30     - Val Leu Trp Pro Thr Pro Gly His Leu Ser Pr - #o Leu Val Val His Arg     #        45     - Gln Leu Ser His Leu Tyr Ala Glu Pro Gln Ly - #s Ser Pro Trp Cys Glu     #    60     - Ala Arg Ser Leu Glu His Thr Leu Pro Val As - #n Arg Glu Thr Leu Lys     #80     - Arg Lys Val Ser Gly Asn Arg Cys Ala Ser Pr - #o Val Thr Gly Pro Gly     #                95     - Ser Lys Arg Asp Ala His Phe Cys Ala Val Cy - #s Ser Asp Tyr Ala Ser     #           110     - Gly Tyr His Tyr Gly Val Trp Ser Cys Glu Gl - #y Cys Lys Ala Phe Phe     #       125     - Lys Arg Ser Ile Gln Gly His Asn Asp Tyr Il - #e Cys Pro Ala Thr Asn     #   140     - Gln Cys Thr Ile Asp Lys Asn Arg Arg Lys Se - #r Cys Gln Ala Cys Arg     145                 1 - #50                 1 - #55                 1 -     #60     - Leu Arg Lys Cys Tyr Glu Val Gly Met Val Ly - #s Cys Gly Ser Arg Arg     #               175     - Glu Arg Cys Gly Tyr Arg Leu Val Arg Arg Gl - #n Arg Ser Ala Asp Glu     #           190     - Gln Leu His Cys Ala Gly Lys Ala Lys Arg Se - #r Gly Gly His Ala Pro     #       205     - Arg Val Arg Glu Leu Leu Leu Asp Ala Leu Se - #r Pro Glu Gln Leu Val     #   220     - Leu Thr Leu Leu Glu Ala Glu Pro Pro His Va - #l Leu Ile Ser Arg Pro     225                 2 - #30                 2 - #35                 2 -     #40     - Ser Ala Pro Phe Thr Glu Ala Ser Met Met Me - #t Ser Leu Thr Lys Leu     #               255     - Ala Asp Lys Glu Leu Val His Met Ile Ser Tr - #p Ala Lys Lys Ile Pro     #           270     - Gly Phe Val Glu Leu Ser Leu Phe Asp Gln Va - #l Arg Leu Leu Glu Ser     #       285     - Cys Trp Met Glu Val Leu Met Met Gly Leu Me - #t Trp Arg Ser Ile Asp     #   300     - His Pro Gly Lys Leu Ile Phe Ala Pro Asp Le - #u Val Leu Asp Arg Asp     305                 3 - #10                 3 - #15                 3 -     #20     - Glu Gly Lys Cys Val Glu Gly Ile Leu Glu Il - #e Phe Asp Met Leu Leu     #               335     - Ala Thr Thr Ser Arg Phe Arg Glu Leu Lys Le - #u Gln His Lys Glu Tyr     #           350     - Leu Cys Val Lys Ala Met Ile Leu Leu Asn Se - #r Ser Met Tyr Pro Leu     #       365     - Val Thr Ala Thr Gln Asp Ala Asp Ser Ser Ar - #g Lys Leu Ala His Leu     #   380     - Leu Asn Ala Val Thr Asp Ala Leu Val Trp Va - #l Ile Ala Lys Ser Gly     385                 3 - #90                 3 - #95                 4 -     #00     - Ile Ser Ser Gln Gln Gln Ser Met Arg Leu Al - #a Asn Leu Leu Met Leu     #               415     - Leu Ser His Val Arg His Ala Ser Asn Lys Gl - #y Met Glu His Leu Leu     #           430     - Asn Met Lys Cys Lys Asn Val Val Pro Val Ty - #r Asp Leu Leu Leu Glu     #       445     - Met Leu Asn Ala His Val Leu Arg Gly Cys Ly - #s Ser Ser Ile Thr Gly     #   460     - Ser Glu Cys Ser Pro Ala Glu Asp Ser Lys Se - #r Lys Glu Gly Ser Gln     465                 4 - #70                 4 - #75                 4 -     #80     - Asn Leu Gln Ser Gln                     485     - (2) INFORMATION FOR SEQ ID NO:4:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1460 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Homo sapi - #ens     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:     - CTATGACATT CTACAGTCCT GCTGTGATGA ATTACAGCAT TCCCAGCAAT GT - #CACTAACT       60     - TGGAAGGTGG GCCTGGTCGG CAGACCACAA GCCCAAATGT GTTGTGGCCA AC - #ACCTGGGC      120     - ACCTTTCTCC TTTAGTGGTC CATCGCCAGT TATCACATCT GTATGCGGAA CC - #TCAAAAGA      180     - GTCCCTGGTG TGAAGCAAGA TCGCTAGAAC ACACCTTACC TGTAAACAGA GA - #GACACTGA      240     - AAAGGAAGGT TAGTGGGAAC CGTTGCGCCA GCCCTGTTAC TGGTCCAGGT TC - #AAAGAGGG      300     - ATGCTCACTT CTGCGCTGTC TGCAGCGATT ACGCATCGGG ATATCACTAT GG - #AGTCTGGT      360     - CGTGTGAAGG ATGTAAGGCC TTTTTTAAAA GAAGCATTCA AGGACATAAT GA - #TTATATTT      420     - GTCCAGCTAC AAATCAGTGT ACAATCGATA AAAACCGGCG CAAGAGCTGC CA - #GGCCTGCC      480     - GACTTCGGAA GTGTTACGAA GTGGGAATGG TGAAGTGTGG CTCCCGGAGA GA - #GAGATGTG      540     - GGTACCGCCT TGTGCGGAGA CAGAGAAGTG CCGACGAGCA GCTGCACTGT GC - #CGGCAAGG      600     - CCAAGAGAAG TGGCGGCCAC GCGCCCCGAG TGCGGGAGCT GCTGCTGGAC GC - #CCTGAGCC      660     - CCGAGCAGCT AGTGCTCACC CTCCTGGAGG CTGAGCCGCC CCATGTGCTG AT - #CAGCCGCC      720     - CCAGTGCGCC CTTCACCGAG GCCTCCATGA TGATGTCCCT GACCAAGTTG GC - #CGACAAGG      780     - AGTTGGTACA CATGATCAGC TGGGCCAAGA AGATTCCCGG CTTTGTGGAG CT - #CAGCCTGT      840     - TCGACCAAGT GCGGCTCTTG GAGAGCTGTT GGATGGAGGT GTTAATGATG GG - #GCTGATGT      900     - GGCGCTCAAT TGACCACCCC GGCAAGCTCA TCTTTGCTCC AGATCTTGTT CT - #GGACAGGG      960     - ATGAGGGGAA ATGCGTAGAA GGAATTCTGG AAATCTTTGA CATGCTCCTG GC - #AACTACTT     1020     - CAAGGTTTCG AGAGTTAAAA CTCCAACACA AAGAATATCT CTGTGTCAAG GC - #CATGATCC     1080     - TGCTCAATTC CAGTATGTAC CCTCTGGTCA CAGCGACCCA GGATGCTGAC AG - #CAGCCGGA     1140     - AGCTGGCTCA CTTGCTGAAC GCCGTGACCG ATGCTTTGGT TTGGGTGATT GC - #CAAGAGCG     1200     - GCATCTCCTC CCAGCAGCAA TCCATGCGCC TGGCTAACCT CCTGATGCTC CT - #GTCCCACG     1260     - TCAGGCATGC GAGTAACAAG GGCATGGAAC ATCTGCTCAA CATGAAGTGC AA - #AAATGTGG     1320     - TCCCAGTGTA TGACCTGCTG CTGGAGATGC TGAATGCCCA CGTGCTTCGC GG - #GTGCAAGT     1380     - CCTCCATCAC GGGGTCCGAG TGCAGCCCGG CAGAGGACAG TAAAAGCAAA GA - #GGGCTCCC     1440     #                 146 - #0     - (2) INFORMATION FOR SEQ ID NO:5:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 485 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Mus muscu - #lus     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:     - Met Ala Phe Tyr Ser Pro Ala Val Met Asn Ty - #r Ser Val Pro Ser Ser     #                15     - Thr Gly Asn Leu Glu Gly Gly Pro Val Arg Gl - #n Thr Ala Ser Pro Asn     #            30     - Val Leu Trp Pro Thr Ser Gly His Leu Ser Pr - #o Leu Ala Thr His Cys     #        45     - Gln Ser Ser Leu Leu Tyr Ala Glu Pro Gln Ly - #s Ser Pro Trp Cys Glu     #    60     - Ala Arg Ser Leu Glu His Thr Leu Pro Val As - #n Arg Glu Thr Leu Lys     #80     - Arg Lys Leu Gly Gly Ser Gly Cys Ala Ser Pr - #o Val Thr Ser Pro Ser     #                95     - Thr Lys Arg Asp Ala His Phe Cys Ala Val Cy - #s Ser Asp Tyr Ala Ser     #           110     - Gly Tyr His Tyr Gly Val Trp Ser Cys Glu Gl - #y Cys Lys Ala Phe Phe     #       125     - Lys Arg Ser Ile Gln Gly His Asn Asp Tyr Il - #e Cys Pro Ala Thr Asn     #   140     - Gln Cys Thr Ile Asp Lys Asn Arg Arg Lys As - #n Cys Gln Ala Cys Arg     145                 1 - #50                 1 - #55                 1 -     #60     - Leu Arg Lys Cys Tyr Glu Val Gly Met Val Ly - #s Cys Gly Ser Arg Arg     #               175     - Glu Arg Cys Gly Tyr Arg Ile Val Arg Arg Gl - #n Arg Ser Ala Ser Glu     #           190     - Gln Val His Cys Leu Asn Lys Ala Lys Arg Th - #r Ser Gly His Thr Pro     #       205     - Arg Val Lys Glu Leu Leu Leu Asn Ser Leu Se - #r Pro Glu Gln Leu Val     #   220     - Leu Thr Leu Leu Glu Ala Glu Pro Pro Asn Va - #l Leu Val Ser Arg Pro     225                 2 - #30                 2 - #35                 2 -     #40     - Ser Met Pro Phe Thr Glu Ala Ser Met Met Me - #t Ser Leu Thr Lys Leu     #               255     - Ala Asp Lys Glu Leu Val His Met Ile Gly Tr - #p Ala Lys Lys Ile Pro     #           270     - Gly Phe Val Glu Leu Ser Leu Leu Asp Gln Va - #l Arg Leu Leu Glu Ser     #       285     - Cys Trp Met Glu Val Leu Met Val Gly Leu Me - #t Trp Arg Ser Ile Asp     #   300     - His Pro Gly Lys Leu Ile Phe Ala Pro Asp Le - #u Val Leu Asp Arg Asp     305                 3 - #10                 3 - #15                 3 -     #20     - Glu Gly Lys Cys Val Glu Gly Ile Leu Glu Il - #e Phe Asp Met Leu Leu     #               335     - Ala Thr Thr Ala Arg Phe Arg Glu Leu Lys Le - #u Gln His Lys Glu Tyr     #           350     - Leu Cys Val Lys Ala Met Ile Leu Leu Asn Se - #r Ser Met Tyr His Leu     #       365     - Ala Thr Ala Ser Gln Glu Ala Glu Ser Ser Ar - #g Lys Leu Thr His Leu     #   380     - Leu Asn Ala Val Thr Asp Ala Leu Val Trp Va - #l Ile Ser Lys Ser Arg     385                 3 - #90                 3 - #95                 4 -     #00     - Ile Ser Ser Gln Gln Gln Ser Val Arg Leu Al - #a Asn Leu Leu Met Leu     #               415     - Leu Ser His Val Arg His Ile Ser Asn Lys Gl - #y Met Glu His Leu Leu     #           430     - Ser Met Lys Cys Lys Asn Val Val Pro Val Ty - #r Asp Leu Leu Leu Glu     #       445     - Met Leu Asn Ala His Thr Leu Arg Gly Tyr Ly - #s Ser Ser Ile Ser Gly     #   460     - Ser Gly Cys Cys Ser Thr Glu Asp Ser Lys Se - #r Lys Glu Gly Ser Gln     465                 4 - #70                 4 - #75                 4 -     #80     - Asn Leu Gln Ser Gln                     485     - (2) INFORMATION FOR SEQ ID NO:6:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1458 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Mus muscu - #lus     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:     - ATGGCATTCT ACAGTCCTGC TGTGATGAAC TACAGTGTTC CCAGCAGCAC CG - #GTAACCTG       60     - GAAGGTGGGC CTGTTCGCCA GACTGCAAGC CCAAATGTGC TATGGCCAAC TT - #CTGGACAC      120     - CTCTCTCCTT TAGCCACCCA CTGCCAATCA TCGCTTCTCT ATGCAGAACC TC - #AAAAGAGT      180     - CCTTGGTGTG AAGCAAGATC ACTAGAACAC ACCTTGCCTG TAAACAGAGA GA - #CCCTGAAG      240     - AGGAAGCTTG GCGGGAGCGG TTGTGCCAGC CCTGTTACTA GTCCAAGCAC CA - #AGAGGGAT      300     - GCTCACTTCT GTGCCGTCTG CAGTGATTAT GCATCTGGGT ATCATTACGG TG - #TCTGGTCC      360     - TGTGAAGGAT GTAAGGCCTT TTTTAAAAGA AGCATTCAAG GACATAATGA CT - #ATATCTGT      420     - CCAGCCACGA ATCAGTGTAC GATAGACAAG AACCGGCGTA AAAACTGCCA GG - #CCTGCCGA      480     - CTTCGCAAGT GTTACGAAGT AGGAATGGTC AAGTGTGGAT CCAGGAGAGA AA - #GGTGTGGG      540     - TACCGAATAG TACGAAGACA GAGAAGTGCC AGCGAGCAGG TGCATTGCCT GA - #ACAAAGCC      600     - AAGAGAACCA GTGGGCACAC ACCCCGGGTG AAGGAGCTAC TGCTGAACTC TC - #TGAGTCCC      660     - GAGCAGCTGG TGCTCACCCT GCTGGAAGCT GAGCCACCCA ATGTGCTAGT GA - #GTCGTCCC      720     - AGCATGCCCT TCACCGAGGC CTCCATGATG ATGTCCCTTA CGAAGCTGGC TG - #ACAAGGAA      780     - CTGGTGCACA TGATTGGCTG GGCCAAGAAA ATCCCTGGCT TTGTGGAGCT CA - #GCCTGTTG      840     - GACCAAGTCC GCCTCTTGGA AAGCTGCTGG ATGGAGGTGC TGATGGTGGG GC - #TGATGTGG      900     - CGCTCCATCG ACCACCCCGG CAAGCTCATC TTTGCTCCAG ACCTCGTTCT GG - #ACAGGGAT      960     - GAGGGGAAGT GCGTGGAAGG GATTCTGGAA ATCTTTGACA TGCTCCTGGC GA - #CGACGGCA     1020     - CGGTTCCGTG AGTTAAAACT GCAGCACAAA GAATATCTGT GTGTGAAGGC CA - #TGATTCTC     1080     - CTCAACTCCA GTATGTACCA CTTGGCTACC GCAAGCCAGG AAGCAGAGAG TA - #GCCGGAAG     1140     - CTGACACACC TATTGAACGC AGTGACAGAT GCCCTGGTCT GGGTGATTTC GA - #AGAGTAGA     1200     - ATCTCTTCCC AGCAGCAGTC AGTCCGTCTG GCCAACCTCC TGATGCTTCT TT - #CTCATGTC     1260     - AGGCACATCA GTAACAAGGG CATGGAACAT CTGCTCAGCA TGAAGTGCAA AA - #ATGTGGTC     1320     - CCGGTGTACG ACCTGCTGCT GGAGATGCTG AATGCTCACA CGCTTCGAGG GT - #ACAAGTCC     1380     - TCAATCTCGG GGTCTGGGTG CTGCTCGACA GAGGACAGTA AGAGCAAAGA GG - #GCTCCCAG     1440     #1458              GA     - (2) INFORMATION FOR SEQ ID NO:7:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 226 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Rattus ra - #ttus     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:     - Glu Leu Val His Met Ile Gly Trp Ala Lys Ly - #s Ile Pro Gly Phe Val     #                15     - Glu Leu Ser Leu Leu Asp Gln Val Arg Leu Le - #u Glu Ser Cys Trp Met     #            30     - Glu Val Leu Met Val Gly Leu Met Trp Arg Se - #r Ile Asp His Pro Gly     #        45     - Lys Leu Ile Phe Ala Pro Asp Leu Val Leu As - #p Arg Asp Glu Gly Lys     #    60     - Cys Val Glu Gly Ile Leu Glu Ile Phe Asp Me - #t Leu Leu Ala Thr Thr     #80     - Ser Arg Phe Arg Glu Leu Lys Leu Gln His Ly - #s Glu Tyr Leu Cys Val     #                95     - Lys Ala Met Ile Leu Leu Asn Ser Ser Met Ty - #r Pro Leu Ala Ser Ala     #           110     - Asn Gln Glu Ala Glu Ser Ser Arg Lys Leu Th - #r His Leu Leu Asn Ala     #       125     - Val Thr Asp Ala Leu Val Trp Val Ile Ala Ly - #s Ser Gly Ile Ser Ser     #   140     - Gln Gln Gln Ser Val Arg Leu Ala Asn Leu Le - #u Met Leu Leu Ser His     145                 1 - #50                 1 - #55                 1 -     #60     - Val Arg His Ile Ser Asn Lys Gly Met Glu Hi - #s Leu Leu Ser Met Lys     #               175     - Cys Lys Asn Val Val Pro Val Tyr Asp Leu Le - #u Leu Glu Met Leu Asn     #           190     - Ala His Thr Leu Arg Gly Tyr Lys Ser Ser Il - #e Ser Gly Ser Glu Cys     #       205     - Ser Ser Thr Glu Asp Ser Lys Asn Lys Glu Se - #r Ser Gln Asn Leu Gln     #   220     - Ser Gln     225     - (2) INFORMATION FOR SEQ ID NO:8:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 243 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Rattus ra - #ttus     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:     - Glu Leu Val His Met Ile Asn Trp Ala Lys Ar - #g Val Pro Gly Phe Gly     #                15     - Asp Leu Asn Leu His Asp Gln Val His Leu Le - #u Glu Cys Ala Trp Leu     #            30     - Glu Ile Leu Met Ile Gly Leu Val Trp Arg Se - #r Met Glu His Pro Gly     #        45     - Lys Leu Leu Phe Ala Pro Asn Leu Leu Leu As - #p Arg Asn Gln Gly Lys     #    60     - Cys Val Glu Gly Met Val Glu Ile Phe Asp Me - #t Leu Leu Ala Thr Ser     #80     - Ser Arg Phe Arg Met Met Asn Leu Gln Gly Gl - #u Glu Phe Val Cys Leu     #                95     - Lys Ser Ile Ile Leu Leu Asn Ser Gly Val Ty - #r Thr Phe Leu Ser Ser     #           110     - Thr Leu Lys Ser Leu Glu Glu Lys Asp His Il - #e His Arg Val Leu Asp     #       125     - Lys Ile Asn Asp Thr Leu Ile His Leu Met Al - #a Lys Ala Gly Leu Thr     #   140     - Leu Gln Gln Gln His Arg Arg Leu Ala Gln Le - #u Leu Leu Ile Leu Ser     145                 1 - #50                 1 - #55                 1 -     #60     - His Ile Arg His Met Ser Asn Lys Gly Met Gl - #u His Leu Tyr Asn Met     #               175     - Lys Cys Lys Asn Val Val Pro Leu Tyr Asp Le - #u Leu Leu Glu Met Leu     #           190     - Asp Ala His Arg Leu His Ala Pro Ala Ser Ar - #g Met Gly Val Pro Pro     #       205     - Glu Glu Pro Ser Gln Ser Gln Leu Thr Thr Th - #r Ser Ser Thr Ser Ala     #   220     - His Ser Leu Gln Thr Tyr Tyr Ile Pro Pro Gl - #u Ala Glu Gly Phe Pro     225                 2 - #30                 2 - #35                 2 -     #40     - Asn Thr Ile     - (2) INFORMATION FOR SEQ ID NO:9:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 243 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Mus muscu - #lus     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:     - Glu Leu Val His Met Ile Asn Trp Ala Lys Ar - #g Val Pro Gly Phe Gly     #                15     - Asp Leu Asn Leu His Asp Gln Val His Leu Le - #u Glu Cys Ala Trp Leu     #            30     - Glu Ile Leu Met Ile Gly Leu Val Trp Arg Se - #r Met Glu His Pro Gly     #        45     - Lys Leu Leu Phe Ala Pro Asn Leu Leu Leu As - #p Arg Asn Gln Gly Lys     #    60     - Cys Val Glu Gly Met Val Glu Ile Phe Asp Me - #t Leu Leu Ala Thr Ser     #80     - Ser Arg Phe Arg Met Met Asn Leu Gln Gly Gl - #u Glu Phe Val Cys Leu     #                95     - Lys Ser Ile Ile Leu Leu Asn Ser Gly Val Ty - #r Thr Phe Leu Ser Ser     #           110     - Thr Leu Lys Ser Leu Glu Glu Lys Asp His Il - #e His Arg Val Leu Asp     #       125     - Lys Ile Thr Asp Thr Leu Ile His Leu Met Al - #a Lys Ala Gly Leu Thr     #   140     - Leu Gln Gln Gln His Arg Arg Leu Ala Gln Le - #u Leu Leu Ile Leu Ser     145                 1 - #50                 1 - #55                 1 -     #60     - His Ile Arg His Met Ser Asn Lys Gly Met Gl - #u His Leu Tyr Asn Met     #               175     - Lys Cys Lys Asn Val Val Pro Leu Tyr Asp Le - #u Leu Leu Glu Met Leu     #           190     - Asp Ala His Arg Leu His Ala Pro Ala Ser Ar - #g Met Gly Val Pro Pro     #       205     - Glu Glu Pro Ser Gln Thr Gln Leu Ala Thr Th - #r Ser Ser Thr Ser Ala     #   220     - His Ser Leu Gln Thr Tyr Tyr Ile Pro Pro Gl - #u Ala Glu Gly Phe Pro     225                 2 - #30                 2 - #35                 2 -     #40     - Asn Thr Ile     - (2) INFORMATION FOR SEQ ID NO:10:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 243 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Homo sapi - #ens     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:     - Glu Leu Val His Met Ile Asn Trp Ala Lys Ar - #g Val Pro Gly Phe Val     #                15     - Asp Leu Thr Leu His Asp Gln Val His Leu Le - #u Glu Cys Ala Trp Leu     #            30     - Glu Ile Leu Met Ile Gly Leu Val Trp Arg Se - #r Met Glu His Pro Val     #        45     - Lys Leu Leu Phe Ala Pro Asn Leu Leu Leu As - #p Arg Asn Gln Gly Lys     #    60     - Cys Val Glu Gly Met Val Glu Ile Phe Asp Me - #t Leu Leu Ala Thr Ser     #80     - Ser Arg Phe Arg Met Met Asn Leu Gln Gly Gl - #u Glu Phe Val Cys Leu     #                95     - Lys Ser Ile Ile Leu Leu Asn Ser Gly Val Ty - #r Thr Phe Leu Ser Ser     #           110     - Thr Leu Lys Ser Leu Glu Glu Lys Asp His Il - #e His Arg Val Leu Asp     #       125     - Lys Ile Thr Asp Thr Leu Ile His Leu Met Al - #a Lys Ala Gly Leu Thr     #   140     - Leu Gln Gln Gln His Gln Arg Leu Ala Gln Le - #u Leu Leu Ile Leu Ser     145                 1 - #50                 1 - #55                 1 -     #60     - His Ile Arg His Met Ser Asn Lys Gly Met Gl - #u His Leu Tyr Ser Met     #               175     - Lys Cys Lys Asn Val Val Pro Leu Tyr Asp Le - #u Leu Leu Glu Met Leu     #           190     - Asp Ala His Arg Leu His Ala Pro Thr Ser Ar - #g Gly Gly Ala Ser Val     #       205     - Glu Glu Thr Asp Gln Ser His Leu Ala Thr Al - #a Gly Ser Thr Ser Ser     #   220     - His Ser Leu Gln Lys Tyr Tyr Ile Thr Gly Gl - #u Ala Glu Gly Phe Pro     225                 2 - #30                 2 - #35                 2 -     #40     - Ala Thr Val     - (2) INFORMATION FOR SEQ ID NO:11:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 66 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Rattus ra - #ttus     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:     - Cys Pro Val Cys Ser Asp Tyr Ala Ser Gly Ty - #r His Tyr Gly Val Trp     #                15     - Ser Cys Glu Gly Cys Lys Ala Phe Phe Lys Ar - #g Ser Ile Gln Gly His     #            30     - Asn Asp Tyr Ile Cys Pro Ala Thr Asn Gln Cy - #s Thr Ile Asp Lys Asn     #        45     - Arg Arg Lys Ser Cys Gln Ala Cys Arg Leu Ar - #g Lys Cys Tyr Glu Val     #    60     - Gly Met     65     - (2) INFORMATION FOR SEQ ID NO:12:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 66 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:     - Cys Ala Val Cys Asn Asp Tyr Ala Ser Gly Ty - #r His Tyr Gly Val Trp     #                15     - Ser Cys Glu Gly Cys Lys Ala Phe Phe Lys Ar - #g Ser Ile Gln Gly His     #            30     - Asn Asp Tyr Met Cys Pro Ala Thr Asn Gln Cy - #s Thr Ile Asp Lys Asn     #        45     - Arg Arg Lys Ser Cys Gln Ala Cys Arg Leu Ar - #g Lys Cys Tyr Glu Val     #    60     - Gly Met     65     - (2) INFORMATION FOR SEQ ID NO:13:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 484 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Rattus ra - #ttus     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:     - Met Thr Phe Tyr Ser Pro Ala Val Met Asn Ty - #r Ser Val Pro Gly Ser     #                15     - Thr Ser Asn Leu Asp Gly Gly Pro Val Arg Le - #u Ser Thr Ser Pro Asn     #            30     - Val Leu Trp Pro Thr Ser Gly His Leu Ser Pr - #o Leu Ala Thr His Cys     #        45     - Gln Ser Ser Leu Leu Tyr Ala Glu Pro Gln Ly - #s Ser Pro Trp Cys Glu     #    60     - Ala Arg Ser Leu Glu His Thr Leu Pro Val As - #n Arg Glu Thr Leu Lys     #80     - Arg Lys Leu Ser Gly Ser Ser Cys Ala Ser Pr - #o Val Thr Ser Pro Asn     #                95     - Ala Lys Arg Asp Ala His Phe Cys Pro Val Cy - #s Ser Asp Tyr Ala Ser     #           110     - Gly Tyr His Tyr Gly Val Trp Ser Cys Glu Gl - #y Cys Lys Ala Phe Phe     #       125     - Lys Arg Ser Ile Gln Gly His Asn Asp Tyr Il - #e Cys Pro Ala Thr Asn     #   140     - Gln Cys Thr Ile Asp Lys Asn Arg Arg Lys Se - #r Cys Gln Ala Cys Arg     145                 1 - #50                 1 - #55                 1 -     #60     - Leu Arg Lys Cys Tyr Glu Val Gly Met Val Ly - #s Cys Gly Ser Arg Arg     #               175     - Glu Arg Cys Gly Tyr Arg Ile Val Arg Arg Gl - #n Arg Ser Ser Ser Glu     #           190     - Gln Val His Cys Leu Ser Lys Ala Lys Arg As - #n Gly Gly His Ala Pro     #       205     - Arg Val Lys Glu Leu Leu Leu Ser Thr Leu Se - #r Pro Glu Gln Leu Val     #   220     - Leu Thr Leu Leu Glu Ala Glu Pro Pro Asn Va - #l Leu Val Ser Arg Pro     225                 2 - #30                 2 - #35                 2 -     #40     - Ser Met Pro Phe Thr Glu Ala Ser Met Met Me - #t Ser Leu Thr Lys Leu     #               255     - Ala Asp Lys Glu Leu Val His Met Ile Gly Tr - #p Ala Lys Lys Ile Pro     #           270     - Gly Phe Val Glu Leu Ser Leu Leu Asp Gln Va - #l Arg Leu Leu Glu Ser     #       285     - Cys Trp Met Glu Val Leu Met Val Gly Leu Me - #t Trp Arg Ser Ile Asp     #   300     - His Pro Gly Lys Leu Ile Phe Ala Pro Asp Le - #u Val Leu Asp Arg Asp     305                 3 - #10                 3 - #15                 3 -     #20     - Glu Gly Lys Cys Val Glu Gly Ile Leu Glu Il - #e Phe Asp Met Leu Leu     #               335     - Ala Thr Thr Ser Arg Phe Arg Glu Leu Lys Le - #u Gln His Lys Glu Tyr     #           350     - Leu Cys Val Lys Ala Met Ile Leu Leu Asn Se - #r Ser Met Tyr Pro Leu     #       365     - Ala Ser Ala Asn Gln Glu Ala Glu Ser Ser Ar - #g Lys Leu Thr His Leu     #   380     - Leu Asn Ala Val Thr Asp Ala Leu Val Trp Va - #l Ile Ala Lys Ser Gly     385                 3 - #90                 3 - #95                 4 -     #00     - Ile Ser Ser Gln Gln Gln Ser Val Arg Leu Al - #a Asn Leu Leu Met Leu     #               415     - Leu Ser His Val Arg His Ile Ser Asn Lys Gl - #y Met Glu His Leu Leu     #           430     - Ser Met Lys Cys Lys Asn Val Val Pro Val Ty - #r Asp Leu Leu Leu Glu     #       445     - Met Leu Asn Ala His Thr Leu Arg Gly Tyr Ly - #s Ser Ser Ile Ser Gly     #   460     - Ser Glu Cys Ser Ser Thr Glu Asp Ser Lys As - #n Lys Glu Ser Ser Gln     465                 4 - #70                 4 - #75                 4 -     #80     - Asn Leu Gln Ser     - (2) INFORMATION FOR SEQ ID NO:14:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 484 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Mus muscu - #lus     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:     - Met Ala Phe Tyr Ser Pro Ala Val Met Asn Ty - #r Ser Val Pro Ser Ser     #                15     - Thr Gly Asn Leu Glu Gly Gly Pro Val Arg Gl - #n Thr Ala Ser Pro Asn     #            30     - Val Leu Trp Pro Thr Ser Gly His Leu Ser Pr - #o Leu Ala Thr His Cys     #        45     - Gln Ser Ser Leu Leu Tyr Ala Glu Pro Gln Ly - #s Ser Pro Trp Cys Glu     #    60     - Ala Arg Ser Leu Glu His Thr Leu Pro Val As - #n Arg Glu Thr Leu Lys     #80     - Arg Lys Leu Gly Gly Ser Gly Cys Ala Ser Pr - #o Val Thr Ser Pro Ser     #                95     - Thr Lys Arg Asp Ala His Phe Cys Ala Val Cy - #s Ser Asp Tyr Ala Ser     #           110     - Gly Tyr His Tyr Gly Val Trp Ser Cys Glu Gl - #y Cys Lys Ala Phe Phe     #       125     - Lys Arg Ser Ile Gln Gly His Asn Asp Tyr Il - #e Cys Pro Ala Thr Asn     #   140     - Gln Cys Thr Ile Asp Lys Asn Arg Arg Lys As - #n Cys Gln Ala Cys Arg     145                 1 - #50                 1 - #55                 1 -     #60     - Leu Arg Lys Cys Tyr Glu Val Gly Met Val Ly - #s Cys Gly Ser Arg Arg     #               175     - Glu Arg Cys Gly Tyr Arg Ile Val Arg Arg Gl - #n Arg Ser Ala Ser Glu     #           190     - Gln Val His Cys Leu Asn Lys Ala Lys Arg Th - #r Ser Gly His Thr Pro     #       205     - Arg Val Lys Glu Leu Leu Leu Asn Ser Leu Se - #r Pro Glu Gln Leu Val     #   220     - Leu Thr Leu Leu Glu Ala Glu Pro Pro Asn Va - #l Leu Val Ser Arg Pro     225                 2 - #30                 2 - #35                 2 -     #40     - Ser Met Pro Phe Thr Glu Ala Ser Met Met Me - #t Ser Leu Thr Lys Leu     #               255     - Ala Asp Lys Glu Leu Val His Met Ile Gly Tr - #p Ala Lys Lys Ile Pro     #           270     - Gly Phe Val Glu Leu Ser Leu Leu Asp Gln Va - #l Arg Leu Leu Glu Ser     #       285     - Cys Trp Met Glu Val Leu Met Val Gly Leu Me - #t Trp Arg Ser Ile Asp     #   300     - His Pro Gly Lys Leu Ile Phe Ala Pro Asp Le - #u Val Leu Asp Arg Asp     305                 3 - #10                 3 - #15                 3 -     #20     - Glu Gly Lys Cys Val Glu Gly Ile Leu Glu Il - #e Phe Asp Met Leu Leu     #               335     - Ala Thr Thr Ala Arg Phe Arg Glu Leu Lys Le - #u Gln His Lys Glu Tyr     #           350     - Leu Cys Val Lys Ala Met Ile Leu Leu Asn Se - #r Ser Met Tyr His Leu     #       365     - Ala Thr Ala Ser Gln Glu Ala Glu Ser Ser Ar - #g Lys Leu Thr His Leu     #   380     - Leu Asn Ala Val Thr Asp Ala Leu Val Trp Va - #l Ile Ser Lys Ser Arg     385                 3 - #90                 3 - #95                 4 -     #00     - Ile Ser Ser Gln Gln Gln Ser Val Arg Leu Al - #a Asn Leu Leu Met Leu     #               415     - Leu Ser His Val Arg His Ile Ser Asn Lys Gl - #y Met Glu His Leu Leu     #           430     - Ser Met Lys Cys Lys Asn Val Val Pro Val Ty - #r Asp Leu Leu Leu Glu     #       445     - Met Leu Asn Ala His Thr Leu Arg Gly Tyr Ly - #s Ser Ser Ile Ser Gly     #   460     - Ser Gly Cys Cys Ser Thr Glu Asp Ser Lys Se - #r Lys Glu Gly Ser Gln     465                 4 - #70                 4 - #75                 4 -     #80     - Asn Leu Gln Ser     - (2) INFORMATION FOR SEQ ID NO:15:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 384 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Homo sapi - #ens     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:     - Ala Leu Ser Pro Leu Val Val His Arg Gln Le - #u Ser His Leu Tyr Ala     #                15     - Glu Pro Gln Lys Ser Pro Trp Cys Glu Ala Ar - #g Ser Leu Glu His Thr     #            30     - Leu Pro Val Asn Arg Glu Thr Leu Lys Arg Ly - #s Val Ser Gly Asn Arg     #        45     - Cys Ala Ser Pro Val Thr Gly Pro Gly Ser Ly - #s Arg Asp Ala His Phe     #    60     - Cys Ala Val Cys Ser Asp Tyr Ala Ser Gly Ty - #r His Tyr Gly Val Trp     #80     - Ser Cys Glu Gly Cys Lys Ala Phe Phe Lys Ar - #g Ser Ile Gln Gly His     #                95     - Asn Asp Tyr Ile Cys Pro Ala Thr Asn Gln Cy - #s Thr Ile Asp Lys Asn     #           110     - Arg Arg Lys Ser Cys Gln Ala Cys Arg Leu Ar - #g Lys Cys Tyr Glu Val     #       125     - Gly Met Val Lys Cys Gly Ser Arg Arg Glu Ar - #g Cys Gly Tyr Arg Leu     #   140     - Val Arg Arg Gln Arg Ser Ala Asp Glu Gln Le - #u His Cys Ala Gly Lys     145                 1 - #50                 1 - #55                 1 -     #60     - Ala Lys Arg Ser Gly Gly His Ala Pro Arg Va - #l Arg Glu Leu Leu Leu     #               175     - Asp Ala Leu Ser Pro Glu Gln Leu Val Leu Th - #r Leu Leu Glu Ala Glu     #           190     - Pro Pro His Val Leu Ile Ser Arg Pro Ser Al - #a Pro Phe Thr Glu Ala     #       205     - Ser Met Met Met Ser Leu Thr Lys Leu Ala As - #p Lys Glu Leu Val His     #   220     - Met Ile Ser Trp Ala Lys Lys Ile Pro Gly Ph - #e Val Glu Leu Ser Leu     225                 2 - #30                 2 - #35                 2 -     #40     - Phe Asp Gln Val Arg Leu Leu Glu Ser Cys Tr - #p Met Glu Val Leu Met     #               255     - Met Gly Leu Met Trp Arg Ser Ile Asp His Pr - #o Gly Lys Leu Ile Phe     #           270     - Ala Pro Asp Leu Val Leu Asp Arg Asp Glu Gl - #y Lys Cys Val Glu Gly     #       285     - Ile Leu Glu Ile Phe Asp Met Leu Leu Ala Th - #r Thr Ser Arg Phe Arg     #   300     - Glu Leu Lys Leu Gln His Lys Glu Tyr Leu Cy - #s Val Lys Ala Met Ile     305                 3 - #10                 3 - #15                 3 -     #20     - Leu Leu Asn Ser Ser Met Tyr Pro Leu Val Th - #r Ala Thr Gln Asp Ala     #               335     - Asp Ser Ser Arg Lys Leu Ala His Leu Leu As - #n Ala Val Thr Asp Ala     #           350     - Leu Val Trp Val Ile Ala Lys Ser Gly Ile Se - #r Ser Gln Gln Gln Ser     #       365     - Met Arg Leu Ala Asn Leu Leu Met Leu Leu Se - #r His Val Arg His Ala     #   380     - (2) INFORMATION FOR SEQ ID NO:16:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 596 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Rattus ra - #ttus     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:     - Met Thr Met Thr Leu His Thr Lys Ala Ser Gl - #y Met Ala Leu Leu His     #                15     - Gln Ile Gln Gly Asn Glu Leu Glu Pro Leu As - #n Arg Pro Gln Leu Lys     #            30     - Met Pro Met Glu Arg Ala Leu Gly Glu Val Ty - #r Val Asp Asn Ser Lys     #        45     - Pro Ala Val Phe Asn Tyr Pro Glu Gly Ala Al - #a Tyr Glu Phe Asn Ala     #    60     - Ala Ala Ala Ala Ala Ala Ala Gly Ala Ser Al - #a Pro Val Tyr Gly Gln     #80     - Ser Ser Ile Thr Tyr Gly Pro Gly Ser Glu Al - #a Ala Ala Phe Gly Ala     #                95     - Asn Ser Leu Gly Ala Phe Pro Gln Leu Asn Se - #r Val Ser Pro Ser Pro     #           110     - Ile Met Ile Leu His Pro Pro Pro His Val Se - #r Pro Phe Leu His Pro     #       125     - His Gly His Gln Val Pro Tyr Tyr Leu Glu As - #n Glu Pro Ser Ala Tyr     #   140     - Ala Val Arg Asp Thr Gly Pro Pro Ala Phe Ty - #r Arg Ser Asn Ser Asp     145                 1 - #50                 1 - #55                 1 -     #60     - Asn Arg Arg Gln Asn Gly Arg Glu Arg Leu Se - #r Ser Ser Ser Glu Lys     #               175     - Gly Asn Met Ile Met Glu Ser Ala Lys Glu Th - #r Arg Tyr Cys Ala Val     #           190     - Cys Asn Asp Tyr Ala Ser Gly Tyr His Tyr Gl - #y Val Trp Ser Cys Glu     #       205     - Gly Cys Lys Ala Phe Phe Lys Arg Ser Ile Gl - #n Gly His Asn Asp Tyr     #   220     - Met Cys Pro Ala Thr Asn Gln Cys Thr Ile As - #p Lys Asn Arg Arg Lys     225                 2 - #30                 2 - #35                 2 -     #40     - Ser Cys Gln Ala Cys Arg Leu Arg Lys Cys Ty - #r Glu Val Gly Met Met     #               255     - Lys Gly Gly Ile Arg Lys Asp Arg Arg Gly Gl - #y Arg Met Leu Lys His     #           270     - Lys Arg Gln Arg Asp Asp Leu Glu Gly Arg As - #n Glu Met Gly Thr Ser     #       285     - Gly Asp Met Arg Ala Ala Asn Leu Trp Pro Se - #r Pro Leu Val Ile Lys     #   300     - His Thr Lys Lys Asn Ser Pro Ala Leu Ser Le - #u Thr Ala Asp Gln Met     305                 3 - #10                 3 - #15                 3 -     #20     - Val Ser Ala Leu Leu Asp Ala Glu Pro Pro Le - #u Ile Tyr Ser Glu Tyr     #               335     - Asp Pro Ser Arg Pro Phe Ser Glu Ala Ser Me - #t Met Gly Leu Leu Thr     #           350     - Asn Leu Ala Asp Arg Glu Leu Val His Met Il - #e Asn Trp Ala Lys Arg     #       365     - Val Pro Gly Phe Gly Asp Leu Asn Leu His As - #p Gln Val His Leu Leu     #   380     - Glu Cys Ala Trp Leu Glu Ile Leu Met Ile Gl - #y Leu Val Trp Arg Ser     385                 3 - #90                 3 - #95                 4 -     #00     - Met Glu His Pro Gly Lys Leu Leu Phe Ala Pr - #o Asn Leu Leu Leu Asp     #               415     - Arg Asn Gln Gly Lys Cys Val Glu Gly Met Va - #l Glu Ile Phe Asp Met     #           430     - Leu Leu Ala Thr Ser Ser Arg Phe Arg Met Me - #t Asn Leu Gln Gly Glu     #       445     - Glu Phe Val Cys Leu Lys Ser Ile Ile Leu Le - #u Asn Ser Gly Val Tyr     #   460     - Thr Phe Leu Ser Ser Thr Leu Lys Ser Leu Gl - #u Glu Lys Asp His Ile     465                 4 - #70                 4 - #75                 4 -     #80     - His Arg Val Leu Asp Lys Ile Asn Asp Thr Le - #u Ile His Leu Met Ala     #               495     - Lys Ala Gly Leu Thr Leu Gln Gln Gln His Ar - #g Arg Leu Ala Gln Leu     #           510     - Leu Leu Ile Leu Ser His Ile Arg His Met Se - #r Asn Lys Gly Met Glu     #       525     - His Leu Tyr Asn Met Lys Cys Lys Asn Val Va - #l Pro Leu Tyr Asp Leu     #   540     - Leu Leu Glu Met Leu Asp Ala His Arg Leu Hi - #s Ala Pro Ala Ser Arg     545                 5 - #50                 5 - #55                 5 -     #60     - Met Gly Val Pro Pro Glu Glu Pro Ser Gln Se - #r Gln Leu Thr Thr Thr     #               575     - Ser Ser Thr Ser Ala His Ser Leu Gln Thr Ty - #r Tyr Ile Pro Pro Glu     #           590     - Ala Glu Gly Phe             595     - (2) INFORMATION FOR SEQ ID NO:17:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 591 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Homo sapi - #ens     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:     - Met Thr Met Thr Leu His Thr Lys Ala Ser Gl - #y Met Ala Leu Leu His     #                15     - Gln Ile Gln Gly Asn Glu Leu Glu Pro Leu As - #n Arg Pro Gln Leu Lys     #            30     - Ile Pro Leu Glu Arg Pro Leu Gly Glu Val Ty - #r Leu Asp Ser Ser Lys     #        45     - Pro Ala Val Tyr Asn Tyr Pro Glu Gly Ala Al - #a Tyr Glu Phe Asn Ala     #    60     - Ala Ala Ala Ala Asn Ala Gln Val Tyr Gly Gl - #n Thr Gly Leu Pro Tyr     #80     - Gly Pro Gly Ser Glu Ala Ala Ala Phe Gly Se - #r Asn Gly Leu Gly Gly     #                95     - Phe Pro Pro Leu Asn Ser Val Ser Pro Ser Pr - #o Ile Met Ile Leu His     #           110     - Pro Pro Pro Gln Leu Ser Pro Phe Leu Gln Pr - #o His Gly Gln Gln Val     #       125     - Pro Tyr Tyr Leu Glu Asn Glu Pro Ser Gly Ty - #r Thr Val Arg Glu Ala     #   140     - Gly Pro Pro Ala Phe Tyr Arg Pro Asn Ser As - #p Asn Arg Arg Gln Gly     145                 1 - #50                 1 - #55                 1 -     #60     - Gly Arg Glu Arg Leu Ala Ser Thr Asn Asp Ly - #s Gly Ser Met Ala Met     #               175     - Glu Ser Ala Lys Glu Thr Arg Tyr Cys Ala Va - #l Cys Asn Asp Tyr Ala     #           190     - Ser Gly Tyr His Tyr Gly Val Trp Ser Cys Gl - #u Gly Cys Lys Ala Phe     #       205     - Phe Lys Arg Ser Ile Gln Gly His Asn Asp Ty - #r Met Cys Pro Ala Thr     #   220     - Asn Gln Cys Thr Ile Asp Lys Asn Arg Arg Ly - #s Ser Cys Gln Ala Cys     225                 2 - #30                 2 - #35                 2 -     #40     - Arg Leu Arg Lys Cys Tyr Glu Val Gly Met Me - #t Lys Gly Gly Ile Arg     #               255     - Lys Asp Arg Arg Gly Gly Arg Met Leu Lys Hi - #s Lys Arg Gln Arg Asp     #           270     - Asp Gly Glu Gly Arg Gly Glu Val Gly Ser Al - #a Gly Asp Met Arg Ala     #       285     - Ala Asn Leu Trp Pro Ser Pro Leu Met Ile Ly - #s Arg Ser Lys Lys Asn     #   300     - Ser Leu Ala Leu Ser Leu Thr Ala Asp Gln Me - #t Val Ser Ala Leu Leu     305                 3 - #10                 3 - #15                 3 -     #20     - Asp Ala Glu Pro Pro Ile Leu Tyr Ser Glu Ty - #r Asp Pro Thr Arg Pro     #               335     - Phe Ser Glu Ala Ser Met Met Gly Leu Leu Th - #r Asn Leu Ala Asp Arg     #           350     - Glu Leu Val His Met Ile Asn Trp Ala Lys Ar - #g Val Pro Gly Phe Val     #       365     - Asp Leu Thr Leu His Asp Gln Val His Leu Le - #u Glu Cys Ala Trp Leu     #   380     - Glu Ile Leu Met Ile Gly Leu Val Trp Arg Se - #r Met Glu His Pro Val     385                 3 - #90                 3 - #95                 4 -     #00     - Lys Leu Leu Phe Ala Pro Asn Leu Leu Leu As - #p Arg Asn Gln Gly Lys     #               415     - Cys Val Glu Gly Met Val Glu Ile Phe Asp Me - #t Leu Leu Ala Thr Ser     #           430     - Ser Arg Phe Arg Met Met Asn Leu Gln Gly Gl - #u Glu Phe Val Cys Leu     #       445     - Lys Ser Ile Ile Leu Leu Asn Ser Gly Val Ty - #r Thr Phe Leu Ser Ser     #   460     - Thr Leu Lys Ser Leu Glu Glu Lys Asp His Il - #e His Arg Val Leu Asp     465                 4 - #70                 4 - #75                 4 -     #80     - Lys Ile Thr Asp Thr Leu Ile His Leu Met Al - #a Lys Ala Gly Leu Thr     #               495     - Leu Gln Gln Gln His Gln Arg Leu Ala Gln Le - #u Leu Leu Ile Leu Ser     #           510     - His Ile Arg His Met Ser Asn Lys Gly Met Gl - #u His Leu Tyr Ser Met     #       525     - Lys Cys Lys Asn Val Val Pro Leu Tyr Asp Le - #u Leu Leu Glu Met Leu     #   540     - Asp Ala His Arg Leu His Ala Pro Thr Ser Ar - #g Gly Gly Ala Ser Val     545                 5 - #50                 5 - #55                 5 -     #60     - Glu Glu Thr Asp Gln Ser His Leu Ala Thr Al - #a Gly Ser Thr Ser Ser     #               575     - His Ser Leu Gln Lys Tyr Tyr Ile Thr Gly Gl - #u Ala Glu Gly Phe     #           590     - (2) INFORMATION FOR SEQ ID NO:18:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 518 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Homo sapi - #ens     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:     - Met Gly Leu Glu Met Ser Ser Lys Asp Ser Pr - #o Gly Ser Leu Asp Gly     #                15     - Arg Ala Trp Glu Asp Ala Gln Lys Pro Gln Se - #r Ala Trp Cys Gly Gly     #            30     - Arg Lys Thr Arg Val Tyr Ala Thr Ser Ser Ar - #g Arg Ala Pro Pro Ser     #        45     - Glu Gly Thr Arg Arg Gly Gly Ala Ala Arg Pr - #o Glu Glu Ala Ala Glu     #    60     - Glu Gly Pro Pro Ala Ala Pro Gly Ser Leu Ar - #g His Ser Gly Pro Leu     #80     - Gly Pro His Ala Cys Pro Thr Ala Leu Pro Gl - #u Pro Gln Val Thr Ser     #                95     - Ala Met Ser Ser Gln Val Val Gly Ile Glu Pr - #o Leu Tyr Ile Lys Ala     #           110     - Glu Pro Ala Ser Pro Asp Ser Pro Lys Gly Se - #r Ser Glu Thr Glu Thr     #       125     - Glu Pro Pro Val Ala Leu Ala Pro Gly Pro Al - #a Pro Thr Arg Cys Leu     #   140     - Pro Gly His Lys Glu Glu Glu Asp Gly Glu Gl - #y Ala Gly Pro Gly Glu     145                 1 - #50                 1 - #55                 1 -     #60     - Gln Gly Gly Gly Lys Leu Val Leu Ser Ser Le - #u Pro Lys Arg Leu Cys     #               175     - Leu Val Cys Gly Asp Val Ala Ser Gly Tyr Hi - #s Tyr Gly Val Ala Ser     #           190     - Cys Glu Ala Cys Lys Ala Phe Phe Lys Arg Th - #r Ile Gln Gly Ser Ile     #       205     - Glu Tyr Ser Cys Pro Ala Ser Asn Glu Cys Gl - #u Ile Thr Lys Arg Arg     #   220     - Arg Lys Ala Cys Gln Ala Cys Arg Phe Thr Ly - #s Cys Ile Arg Val Gly     225                 2 - #30                 2 - #35                 2 -     #40     - Met Leu Lys Glu Gly Val Arg Leu Asp Arg Va - #l Arg Gly Gly Arg Gln     #               255     - Lys Tyr Lys Arg Arg Pro Glu Val Asp Pro Le - #u Pro Phe Pro Gly Pro     #           270     - Phe Pro Ala Gly Pro Leu Ala Val Ala Gly Gl - #y Pro Arg Lys Thr Ala     #       285     - Ala Pro Val Asn Ala Leu Val Ser His Leu Le - #u Val Val Glu Pro Glu     #   300     - Lys Leu Tyr Ala Met Pro Asp Pro Ala Gly Pr - #o Asp Gly His Leu Pro     305                 3 - #10                 3 - #15                 3 -     #20     - Ala Val Ala Thr Leu Cys Asp Leu Phe Asp Ar - #g Glu Ile Val Val Thr     #               335     - Ile Ser Trp Ala Lys Ser Ile Pro Gly Phe Se - #r Ser Leu Ser Leu Ser     #           350     - Asp Gln Met Ser Val Leu Gln Ser Val Trp Me - #t Glu Val Leu Val Leu     #       365     - Gly Val Ala Gln Arg Ser Leu Pro Leu Gln As - #p Glu Leu Ala Phe Ala     #   380     - Glu Asp Leu Val Leu Ile Glu Glu Gly Ala Ar - #g Ala Ala Gly Leu Gly     385                 3 - #90                 3 - #95                 4 -     #00     - Glu Leu Gly Ala Ala Leu Leu Gln Leu Val Ar - #g Arg Leu Gln Ala Leu     #               415     - Arg Leu Glu Arg Glu Glu Tyr Val Leu Leu Ly - #s Ala Leu Ala Leu Ala     #           430     - Asn Ser Asp Ser Val His Ile Glu Asp Glu Pr - #o Arg Leu Trp Ser Ser     #       445     - Cys Glu Lys Leu Leu His Glu Ala Leu Leu Gl - #u Tyr Glu Ala Gly Arg     #   460     - Ala Gly Pro Gly Gly Gly Ala Glu Arg Arg Ar - #g Ala Gly Arg Leu Leu     465                 4 - #70                 4 - #75                 4 -     #80     - Leu Thr Leu Pro Leu Leu Arg Gln Thr Ala Gl - #y Lys Val Leu Ala His     #               495     - Phe Tyr Gly Val Lys Leu Glu Gly Lys Val Pr - #o Met His Lys Leu Phe     #           510     - Leu Glu Met Leu Glu Ala             515     - (2) INFORMATION FOR SEQ ID NO:19:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 431 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Homo sapi - #ens     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:     - Met Ser Ser Glu Asp Arg His Leu Gly Ser Se - #r Cys Gly Ser Phe Ile     #                15     - Lys Thr Glu Pro Ser Ser Pro Ser Ser Gly Il - #e Asp Ala Leu Ser His     #            30     - His Ser Pro Ser Gly Ser Ser Asp Ala Ser Gl - #y Gly Phe Gly Met Ala     #        45     - Leu Gly Thr His Ala Asn Gly Leu Asp Ser Pr - #o Pro Met Phe Ala Gly     #    60     - Ala Gly Leu Gly Gly Asn Pro Cys Arg Lys Se - #r Tyr Glu Asp Cys Thr     #80     - Ser Gly Ile Met Glu Asp Ser Ala Ile Lys Cy - #s Glu Tyr Met Leu Asn     #                95     - Ala Ile Pro Lys Arg Leu Cys Leu Val Cys Gl - #y Asp Ile Ala Ser Gly     #           110     - Tyr His Tyr Gly Val Ala Ser Cys Glu Ala Cy - #s Lys Ala Phe Phe Lys     #       125     - Arg Thr Ile Gln Gly Asn Ile Glu Tyr Ser Cy - #s Pro Ala Thr Asn Glu     #   140     - Cys Glu Ile Thr Lys Arg Arg Arg Lys Ser Cy - #s Gln Ala Cys Arg Phe     145                 1 - #50                 1 - #55                 1 -     #60     - Met Lys Cys Ile Lys Val Gly Met Leu Lys Gl - #u Gly Val Arg Leu Asp     #               175     - Arg Val Arg Gly Gly Arg Gln Lys Tyr Lys Ar - #g Arg Leu Asp Ser Glu     #           190     - Asn Ser Pro Tyr Leu Ser Leu Gln Ile Ser Pr - #o Pro Ala Lys Lys Pro     #       205     - Leu Thr Lys Ile Val Ser Tyr Leu Leu Val Al - #a Glu Pro Asp Lys Leu     #   220     - Tyr Ala Met Pro Pro Asp Asp Val Pro Glu Gl - #y Asp Ile Lys Ala Leu     225                 2 - #30                 2 - #35                 2 -     #40     - Thr Thr Leu Cys Asp Leu Ala Asp Arg Glu Le - #u Val Phe Leu Ile Ser     #               255     - Trp Ala Lys His Ile Pro Gly Phe Ser Asn Le - #u Thr Leu Gly Asp Gln     #           270     - Met Ser Leu Leu Gln Ser Ala Trp Met Glu Il - #e Leu Ile Leu Gly Ile     #       285     - Val Tyr Arg Ser Leu Pro Tyr Asp Asp Lys Le - #u Ala Tyr Ala Glu Asp     #   300     - Tyr Ile Met Asp Glu Glu His Ser Arg Leu Va - #l Gly Leu Leu Glu Leu     305                 3 - #10                 3 - #15                 3 -     #20     - Tyr Arg Ala Ile Leu Gln Leu Val Arg Arg Ty - #r Lys Lys Leu Lys Val     #               335     - Glu Lys Glu Glu Phe Val Met Leu Lys Ala Il - #e Ala Leu Ala Asn Ser     #           350     - Asp Ser Met Tyr Ile Glu Asn Leu Glu Ala Va - #l Gln Lys Leu Gln Asp     #       365     - Leu Leu His Glu Ala Leu Gln Asp Tyr Glu Le - #u Ser Gln Arg His Glu     #   380     - Glu Pro Arg Arg Ala Gly Lys Leu Leu Leu Th - #r Leu Pro Leu Leu Arg     385                 3 - #90                 3 - #95                 4 -     #00     - Gln Thr Ala Ala Lys Ala Val Gln His Phe Ty - #r Ser Val Lys Leu Gln     #               415     - Gly Lys Val Pro Met His Lys Leu Phe Leu Gl - #u Met Leu Glu Ala     #           430     __________________________________________________________________________ 

We claim:
 1. An isolated polypeptide displaying the biological activity of ERβ, said polypeptide having the amino acid sequence of SEQ ID NO:2, SEQ ID NO:3, or SEQ ID NO:5.
 2. An isolated nucleic acid which codes for a polypeptide displaying the biological activity of ERβ, said polypeptide having the amino acid sequence of SEQ ID NO:2, SEQ ID NO:3, or SEQ ID NO:5.
 3. The isolated nucleic acid according to claim 2, said nucleic acid having the nucleic acid sequence of SEQ ID NO:1, SEQ ID NO:4, or SEQ ID NO:6.
 4. A method for identifying a molecule which binds with ERβ, comprising the steps of:(a) contacting an isolated polypeptide displaying the biological activity of ERβ, said polypeptide having the amino acid sequence of SEQ ID NO:2, SEQ ID NO:3 or SEQ ID NO:5, with a molecule in vitro, under conditions which permit said polypeptide to bind to said molecule; and (b) detecting the binding of said polypeptide with said molecule.
 5. An isolated polypeptide displaying the biological activity of ERβ, said polypeptide having the amino acid sequence of SEQ ID NO:2.
 6. An isolated nucleic acid which codes for a polypeptide displaying the biological activity of ERβ, said polypeptide having the amino acid sequence of SEQ ID NO:2.
 7. The isolated nucleic acid according to claim 6, said nucleic acid having the nucleic acid sequence of SEQ ID NO:1. 